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PREFACE 


Since 1978 three successful conferences in computational 
fluid dynamics have been organised at Reading by the Institute 
of Mathematics and its Applications. Of the last two, that in 
1981 was specifically on aeronautical fluid dynamics while that 
in 1982 was aimed at exploring the common features, developments 
and difficulties of numerical methods for the whole range of 
fluid flow applications. At that meeting it was proposed that 
the next conference should be expanded to combine the coverage 
of these two. 


The Conference on which the present book is based was the 
result. It was organised by the ICFD (Institute for 
Computational Fluid Dynamics, sponsored by the SERC at the 
Universities of Reading and Oxford) in association with the 
IMA. 


The aim of the conference was to bring together 
mathematicians and engineers in the field of computational 
aerodynamics and computational fluid dynamics to review recent 
advances in mathematical and computational techniques for 
modelling fluid flows. The subject area is very large with 
very many active researchers in industry, government 
laboratories and universities working on a wide variety of 
methods and applications. Thus to give coherence to the meeting 
three main themes were selected:- 


Alternative Data Representations - by which is meant 
the various forms of discretisation that may be used to 
model a fluid - such as particle, spectral or vortex 
models, finite difference and finite element viewpoints 
and alternative choices of dependent variables. 


Adaptive Modelling - that is modelling which in some 
way responds to the features of the flow as part of its 
solution procedure. 


Solution of Systems of Linear and Non-linear 
equations - particularly those arising from discretised 
models of fluid flow. 
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Particular thanks are due to the organising committee, 
Dr. N.E. Hoskin (AWRE), Dr. J. Rae (Harwell) and Professor 
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LOCALLY ADAPTIVE MOVING FINITE ELEMENTS 


M.J. Baines 
(Department of Mathematics*, University of Reading) 


l. INTRODUCTION 


Computational Fluid Dynamics covers a wide range of 
modelling activity. It includes complex codes based on 
traditional methods for difficult and pressing engineering 
problems and, at the other end of the spectrum, the 
development of numerical methodology stimulated by the 
desire for flexible and efficient new algorithms. It is with 
the latter area that this paper is concerned, in particular 
with the development of algorithms which adapt in some way 
to the flow field. By considering moving grids we overcome 
some of the limitations peculiar to fixed grids while raising 
other issues. Here we describe and develop the Moving 
Finite Element (MFE) technique in relation to hyperbolic 
conservation laws. 


We begin in one dimension by summarising the MFE method 
for piecewise linear approximations as originally presented 
in (Miller, 1981) but without his use of penalty functions: 
see (Wathen and Baines, 1985). We then describe an 
alternative formulation, based on element basis functions, 
which is equivalent to the Miller formulation but involves 
only local projections as in (Baines, 1985). From this 
formulation we obtain ordinary differential equations in 
time for the evolution of each element segment in terms of 
its velocity and angular speed. The determination of the 
nodal velocities from the motion of the segments (also local) 
is then described, and the resulting method is used to 


te te pee ee 
*The work reported here forms part of the research programme 


of the Oxford/Reading Institute for Computational 
Fluid Dynamics. 
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derive the decomposition of the standard MFE matrix. 


We then describe singularities of the method. In the 
rather rare event of the nodes becoming collinear (parallelism) 
onè of the local matrices becomes singular, and a special 
non-local procedure is needed which is however simple 
to implement. The only other singularity of the method 
occurs when nodes overtake one another. This arises if the 
time step (in the numerical integration of the semi-discrete 
problem) is so large as to destroy the accuracy. However it 
may also arise (correctly) in hyperbolic problems when a 
shock forms. In that case the appropriate jump conditions 
may be used to continue the solution. 


Time-stepping strategies are then discussed in relation 
to accuracy and practical results. So far the forward Euler 
explicit method has been found to be sufficient, as in 
(Wathen, 1984 and Johnson, 1984). Results are given for two 
non-linear examples, the inviscid Burgers' equation (for 
which the MFE method gives the exact solution) and the 
Buckley-Leverett equation. Both programs run in BASIC on a 
home computer. 


Generalising to systems of equations we describe a method 
which uses separate grids for each conserved variable as given 
in (Baines and Wathen, 1985). Results are given for a shock 
tube problem. Again a small microcomputer is sufficient to 
obtain a reasonable approximate solution to this problem. 


The method has considerable potential in two dimensions, 
see (Wathen, 1984). However care is required in that the 
time derivative VE of the piecewise linear continuous 


function v in two dimensions now belongs only to a subspace 
of the space S spanned by the element basis functions. 


Hence a constrained projection is considered which gives the 
standard Miller method. The approach again demonstrates that 
the MFE matrix has a decomposition similar to that in one 
dimension. A local segment velocity can also be derived 
which assists in determining the rules for shock formulation. 
Results are given for a non-linear problem in two dimensions, 
a generalisation of the Buckley-Leverett equation. 


2. STANDARD MFE APPROACH 
We first describe the standard Moving Finite Element method 


as presented in (Miller, 1981). We begin by approximating u 
in the scalar equation 


ae ae Oe 
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+£ = 
u, (a). o (2.1) 
by the piecewise linear function v given by 


v=2LaQ 2.2 
uit ( ) 
where the Ek are nodal coefficients and the a, are linear 
J 
basis functions as shown in Figure l(a). 
1 


(a) (b) 


Fig. l. Basis functions (a) ch and (b) 8, 


For fixed finite elements es depends on time t and a 


depends on the space co-ordinate x. In the MFE method 
a, depends also on the time t through the nodal co-ordinates 


S4 (i = 0,1,..-,N+1). In order to study the solution of the 


conservation law (2.1) we differentiate v with respect to time. 
Since t appears twice in the expression for v we have 


Me 3" 


=f (åa, + a,d,) (2.3) 
j 

where the dot denotes differentiation with respect to time. 

Now, by the chain rule, 


Ja 
= i m (2.4) 
j i i 


so that (2.3) becomes 
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Yt 


=rå a, +Ia, I$ —+ (2.5) 
j 3d gaar 


By interchanging the order of summation we obtain 


= (åa, + &.8.) , (2.6) 
Tg u? ( j 58a) 
J 
where 

da, 
B= la, m (2.7) 

j i ðs, 

J i j 


is a second type basis function dependent not only on t and on 
8, but also on a, the vector of nodal coefficients. From 


(2.2) we may write 


Baie (2.8) 


which in the case of piecewise linear basis functions can be 
written (after some manipulation) as 


B, = -ma (2.9) 


ae 
where m is the local slope of v: (see also (Lynch, 1981)). 


Thus the second type basis function 8 j has components 
which are multiples of GA and have the same support as ye 


(The result is true for linear basis functions in any number 
of dimensions (see (Baines, 1985)). Diagrams illustrating the 
a, and B, in one dimension are shown in Figure l. 


J j 


Hence in one dimension 


v, =E (åa, + 8.8.) (2.10) 
PF 


a ia Jpj 


belongs to the space of piecewise linear discontinuous 
functions Sug! say. 


EE f(v) also belongs to sy we immediately obtain from 


B 


(2.1) ordinary differential equations for a, and 5} in time. 
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For example if 


f(u) = ; u (2.11) 


the ODE's are 


å, =0 5, =a 


j j j” (2.12) 


which are readily integrated to give 3 and #4 In this 


particular case the trajectories of the nodes are the 
characteristics of the partial differential equation as in 
(Wathen, 1984). This is an exceptional case however. 


More generally we may project EV into the space SaB by 


minimising the Ly norm of the residual 


lly, + ev) 11, (2.13) 


over the variables a, and šj: This leads to the double set 


of Galerkin equations 


p E + v) > = 
<A f(v) [0] 
v, (2.14) 


<B., v, + £(v) =O 


J pie 


Substituting for v from (2.10) we obtain the matrix 
system of ODE's 


ay =g (2.15) 
where 

x = Lee rdyrBy eee] (2.16) 

<a, B> 
y en [Sha am 3 ae! 

A= {ays}, As, 
<Byra,> <B; +84? 
a, 

(2.18) 


a a = {g,} oF ‘s dia e t 
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The matrix A is symmetric block 2x2 tri-diagonal, positive 
semi-definite, depending on y. Equation (2.15) gives the 
MFE equations as derived in (Miller, 1981) without the 

use of penalty functions. 


It has been shown in (Morton, 1982) that the MFE equations 
(2.15) carry the best least squares fit to the exact 
solution in the case of a scalar conservation law. 


3. LOCAL APPROACH 


We now adopt instead a local elementwise approach. The 
function v in the space of piecewise linear discontinuous 


functions may be re-parameterised in the form 


)=f (w 


Strat iter t katka; D 


using element basis functions Pki as shown in Figure 2. The 


Wi are the coefficients of the Ms in the expansion which 
can be related to the a8, (see below). Denote by Sy the 


space spanned by the basis functions tki’ this is the same 


space as Sa in the one-dimensional case. 


B 
1 1 
$ $ 
kı k2 
Ski Skz 


Fig. 2. Basis fanctioņs yy and oy 0 


(a) on an element, (b) around a node. 


Again, if £(v) belongs to S, (= ) we obtain Wa Y) 


S 
¢ ap k2 
at once and consequently ay and Bie More generally we may 
again project £(v) into Sy This can be done locally 


within each element by minimising the local element Ly norm 
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lly, + em II, (3.2) 
over We We thus obtain alternative Galerkin equations 
<$; V, + f(v) >=0 

kl t x (3.3) 
<t.o7 Ye t £(v) > =0 
for each element which can be written in the form 
Ck woe by. , (3.4) 
a 2x2 system. In (3.4) 
Tre 
w [w Wa] (3.5) 


<M? Skr’ Pen” 
ck = (3.6) 


Sokar tr? Stz’? 


$ 
and b =<, , -£(v)> - (3.7) 
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To relate the W'S to á and 55 we use the fact that 


Oy e E ak er By e o a yee? ge! 
(3.8) 


where elements j-}4, j+4 are adjacent to node j, as in 
Figure 2. Then using (3.1) we have the correspondence 


E E 
W- (3.9) 
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This can be written in the form 


Mj x =W (3.10) 
another 2x2 system, where 
¥ = là] (3.11) 
a N A 
Be | i B (3.12) 
j j+ 
Jiz (3.13) 


Since both the Miller method and the local elementwise 
method minimise the same residual in the same space the 
MFE equations derived from the two methods must be identical. 


It follows that, by writing 


i yl ee (3.14) 


we have from (2.15), (2.16), (3.11), (3.10), (3.4), (3-7), 
(3.8) and (2.18) the decomposition of the global MFE matrix A 


A= m'em ' B-15) 
where both C and M.are diagonal block 2x2 matrices (apart 
from end effects). 

A particular property shared by the methods is 
conservation. Since both the ye and the ti S are a 
partition of the unit function, summation of the first of 
(2.14) or (3.3) gives 
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SN+1 


Iv, + f(v) lax =0 , (3.16) 


So 


from which we may deduce that 


a SN+L Sy+1 
a feel a 


Sue 


and consequently that Í y dx is constant in time apart 


So 


from boundary effects. 


Boundary conditions may be imposed locally on the element 
adjacent to boundaries as in (Baines, 1985). In the case of 
a Neumann condition at the end j = O we have 


85 = fe) (3.18) 


which, in conjunction with the second of (3.9), gives 


a = W1 (3.19) 


and hence the motion of the boundary element. 


If the boundary condition at the end j = 0 is Dirichlet 
then, because we cannot impose both 


&..= 0... a. =0 (3.20) 


simultaneously and preserve the projection, a special 
constrained projection has to be carried out in the end 
element. The result is that 


OUNI = 3 b, o/s =s) 1 (3.21) 


where b, 2 is the second of (3.7) for the end element. This 
, 
is consistent with (3.20). 
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4. LOCAL ELEMENT MOTION 
We now show that the motion of a local segment of the 


approximating piecewise linear function may be obtained 
entirely from the local projection step (3.4). 


y 


k + 


Fig. 3. Local segment angle and velocities 


Let Vk be the velocity of the mid-point of the segment in 


the direction perpendicular to the segment and let k be the 


angle between the segment and the x-axis, as shown in 
Figure 3. Then the equations (3.9) can be written in 
the form 


ayy cosé,. = Soy sin®, = wW cos6, | 
Vk 


aay cos, ~ Sey sin®, = W2 cost, 


(4.1) 


where nodes k-4, k+4 are the ends of element k, as in 
Figure 3. The left hand sides:of (4.1) are the velocities 
Vk-4' Veti (due to the motion of the single element k only) 


of the ends of the element k in Figure 3 at right angles to 
the element. (The full velocity of a node will be a 
combination of two such element end velocities from adjacent _ 
segments) . 


2 es 7 kk 
Sty 7 Sk 


Let (4.2) 


Then these end velocities may be written 
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[v W, b, 
kk kl 
e 1 Si [PSs EE u as 


cosé, = =a RON 
Mets]? [Mea Vam) Pho Var) mer (Sezer as 


(4.3) 


using (3.4) and (3.7). Since [1,1]? is a eigenvector of the 
symmetric matrix C, with eigenvalue 4(s,,, - Sp) = 4ds,, say, 


k 
we obtain 
[1,1] i! APAA Bibi ti ieee 
? Yeu V) Ask a E ? 
S, 
k+ 
z r <E > = pnra ikii 5 £(v) dx (4.4) 
Y(n) k As ym) 75,5 
Hence 
Vy = aV + Yal ae e UEa E a 
be SS Yim) Sk 


(4.5) 
which gives the normal velocity of the mid-point of the 
segment in Figure 3. Putting Ag. = LEV) un - (E) pa} 
this may be written in the alternative forms 


A£, Af, Af, 


=- — =- —*=-— 4. 
Vk is, cos a PO iy sin 8 (4.6) 


where Av, = day = akis - ay and PQ are as shown in Figure 3. 


Thus we have the important result that the speed of the 
mid-point of the segment in Figure 3 in the direction normal 
to the segment is consistent with the local average speed 


A£ /dv,. in the element. 
Subtracting pairs of the equations (3.9) we obtain another 
important result, namely, 


= Wo T Mp (4.7) 
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or, using (4.2), 


k 1 
eet Sy = -8 


Since [-1 1] is also an eigenvector of C, with eigenvalue 
l 4s, we obtain 


ek 
dm, <o,, £(v)_> Skt 
6 kl, e a ER - £(v) dx 
RTS as RP a ai= saa (katk? 
at (sx) <p E? Psg spa x 


(4.9) 


This leads to the alternative forms 


ae a 


= a) A, mas 2 " 
Tan Mig a (i) £ ox (My) =-m, £ "(n) (4.10) 


dt Es] 


where 
a l Skt ps 
Eia is, f f(v)dx, f= AUE) yy + (EV) pag) 


k-4 
(4.11) 


and "Ny = (Sy age Skay) © 


Hence we have the result that the rate of change of the 
slope of the solution in an element is equal to the second 
space derivative of the flux function. In other words the 
solution segment rotates in response to the local convexity 
of f. Another form of this result is 


2 
dé. ~ 
k Tk z 
T ATEST f my?) . (4.12) 
Lt m 


As the segments move the intersections (nodes) also move, 
giving the nodal velocities. Note that the segments have 
lengths that vary with time. The movement of a node is thus 
the locus of the intersection of adjacent elements (see 
Figure 4). 
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s; a 
j 5j 
Fig. 4. Nodal movement as locus of segment intersection 


The results (4.6) and (4.10) show clearly the element 
behaviour in terms of the flux function. 


5. SUMMARY AND PARALLELISM 


We give now a summary of the local elementwise method 
which is complete provided that none of the matrices 
involved are singular. We then go on to discuss the 
treatment of such singularities. 


Summary 
(1) Evaluate by = <hr -£(v) > 
(2) Solve Cm, = be for wy V elements k 


(3) Pair off the w's nodewise 
5 5 oo) Cet 
(4) Solve M, nE =W for x, = (a, /8,1 Vnodes j. 


Note that the method involves the inversion of 2x2 matrices 
only: little storage is required and the algorithm can be 
run on a small computer. 

The method may break down if any of the matrices Gk or M, 
is singular. Singularity of i corresponds to collinearity 


of nodes, usually described by the term parallelism. This 
occurs when 


Moa = My (5.1) 


a igs >, Singularity of Q arises only if As = 0 (see 


43 
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Consider singularity of “m first. In this case we find 


that we obtain inconsistent solutions of the pairs of 
equations (3.10). As a result we can no longer solve 
equation (3.10) locally. The remedy is to temporarily fix 
any parallel nodes, solve over a patch consisting of these 
nodes and their neighbours and relocate the parallel nodes in 
some averaged way as in (Wathen and Baines, 1985 and Baines, 
1985). It is convenient to return to the a,8 basis by 
combining equations (3.4) in staggered pairs. Then in the 
event of parallelism at a single node we retain the 
combination of these equations corresponding to the basis 
function a6 and replace the second by 


$, = 0. (5.2) 


This gives 

1 $ l 

A Oe) 5 oh + 3 45-2,5) "54,0 + 345 44,5) ¥5 43,2 

(5.3) 


1 
TEAR IN = 
6 jae) e 5k, Pasa 


Sin $, = h å, =w, = . i 
ce now ai O we have 4, W552 W543, from (3.9) in 


the reduced problem which yields 


a 
es Bsa + Pjg, 7 giS 4, 1+ 4543.5) 54, oF (5.4) 
j 
La + A. ys) 
Begnk® t Bjs 
B =0 


as the solution for the modified system. 


The null space of the singular matrix Mj is spanned by the 
T 
vector [m,1]" (where 7m = Mu, =m) and an appropriate 


multiple of this vector may be added to satisfy an externally 
imposed averaged velocity or position. 


If several nodes become parallel simultaneously a number 
of equations of the type (5.3) will occur and it may be 
necessary to solve a tri-diagonal system if the nodes are 
adjacent to one another. 
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Parallelism is a rather rare occurrence and is usually 
associated with the curvature of the solution changing sign 
during the evolution, or perhaps with the evolution to a 
steady state. 


We go now to consider singularity of a Since this type 


of singularity is associated with node overtaking as a result 
of time stepping we discuss it in the next section in 
association with time integration. 


6. TIME STEPPING AND NODE OVERTAKING 


The MFE method is semi-discrete and gives rise to ordinary 
differential equations in time which require integration to 
obtain the full solution. 


We have already seen that in the case of the inviscid 
Burgers' equation the nodes move along characteristics, 
while for the general scalar hyperbolic law the least 
Squares best fit to the exact solution is carried asymptotically 
for small time steps. Approximate time stepping will degrade 
the latter property if the time step is too large, while the 
former property will be lost for more general flux functions. 


It has been found that using the Euler explicit forward 


difference method is sufficient in the examples tried so far 
by Wathen (1984) and Johnson (1984). In no case do implicit 
methods give any advantage. We therefore use 


n+l) o ntl _ on =l at lat SL 

aj =X % + At Mj i aj + At Mj My 

n+l n (6.1) 
8, By 


(see (3.10)). 


As far as the choice of At is concerned we still require 
an accuracy criterion. Algorithms for accuracy are not well 
developed but in view of the simplicity of the method we can 
afford to be generous in taking a trial and error approach. 

A possible algorithm compares the result of one MFE step with 
that of two half steps and continues halving the step until 
the difference between the two results is acceptable, as in 
(Baines, 1985). 


A major difficulty with time stepping is that the nodes 
may overtake one another if At is not small enough. 
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In problems whose solution is expected to be smooth this gives 
an implicit condition on the time step. For hyperbolic 
problems which admit shocks however we expect discontinuities 
to form and we can take advantage of node overtaking to 

model shocks in an effective way. Moreover the condition on 
At is now explicit in that it is the shortest time taken 

for one node to catch up with the next as in (Wathen and 
Baines, 1985). 


As the separation of nodes goes to zero the element 
segment becomes vertical (parallel to the u-axis) and from 
(4.6) with a > m/2 the normal velocity of the mid-point of 


the segment tends smoothly to the shock speed, at least in 
the semi-discrete case. The shock speed may then be "frozen", 
ie. imposed on both nodes of the shocked element: this acts 
as an internal boundary condition and the solution on the 
adjacent’ elements to left and right may proceed separately. 
The procedure is also feasible when nodes run into shocks or 
when shocks overtake shocks, in which cases a node can be 
deleted. 


From equation (4.12) we see that the angular speed of 
the segment is non-zero when the shock forms (my. > œ) so that 


there is a change of state at this instant. The manner in 
which the shock forms in the MFE method is consistent with 
the (Oleinik, 1956) entropy inequality in the semi-discrete 
case. Moreover this is also true for expansions. 


Accuracy in space is of course determined by the number 
of nodes used to represent the solution and this is decided 
when carrying out the projection of the initial data into 
the piecewise linear space. Less obviously it has been 
found that it is crucial how these nodes are distributed in 


space, equidistribution of wr)” being an effective choice: 
see (Herbst, 1982). 


We show numerical results for two scalar problems, the 
inviscid Burgers' equation and the Buckley-Leverett equation. 
These are shown in Figure 5, The first gives the 
exact solution for a convex flux function while the 
second is an approximate solution in the case of a 
non-convex flux function. 


(b) 
Fig. 5. MFE solutions for the scalar non-linear equation 
LRG» f =O. 


; t 
(a) f(u) = hu" (Burgers') 
2 
(b) £(u) = > (Buckley-Leverett) 
u + 4(1-u) 


with initial data as in (Wathen and Baines, 1985). 


17 


18 BAINES 


7. EXTENSIONS TO 1-D SYSTEMS 


In extending the above ideas of systems of conservation 
laws we are faced with an immediate decision. Should we 
work with separate nodal coefficients and a common mesh or 
should we give each component of the system its own mesh 
with individual nodal coefficients and co-ordinates? 


Where discontinuous features are expected to occur 
simultaneously for all components of the system, as in the 
Euler equations, there is an argument for using a common 
mesh. However the useful algebraic structure of previous 
sections is only preserved when each component is given its 
own mesh. 


One possible strategy is to use a single mesh whose 
movement is determined by a single preferred component of the 
system as in (Baines and Wathen, 1985). For example, in 
the Euler equations we might choose the density as the most 
significant component and use that to drive the nodes. The 
remaining components are then determined on a prescribed 
moving mesh. 


The difficulty here is that the flux function in the 
density equation is the momentum, which is itself piecewise 
linear on the same mesh as the density. By (4.10) it follows 
that the slopes of the element segments of the density do 
not change with time, which is much too restrictive. 


We therefore concentrate here on a model which uses a 
different mesh for each component of the system. As a 
result of the independence of each mesh we can easily solve 
the MFE equations in the manner of earlier sections, once 
the right hand sides have been set up. Thus the only new 
feature is the quadrature in equation (3.7) which links the 
components of the system through the evaluation of £(v). 

In one dimension the elements can be subdivided suitably 
with an elementary quadrature over each sub-element. 


The only other difficulty arises in shock modelling. 
A feature of a shock in gasdynamics is that components shock 
simultaneously and this is not guaranteed in the numerical 
method. An additional device is therefore needed in general 
to ensure that when a shock occurs it is simultaneous in 
the appropriate components. 


We give numerical results for the well known shock tube 
problem used as a basis for comparisons by (Sod, 1978). 
This is shown in Figure 6. 
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DENSITY VELOCITY 


-1000 02 04 O06 O8 1.01.1-1000 02 04 O06 O8 1.01.1 


PRESSURE ENERGY 


-1.00.0 02 04 06 08 1.01.1 -1000 02 04 06 08 1.01.1 


Fig. 6. MFE solutions for the Sod shock tube problem at 
t = 0.144 from initial data at t = 0.1 as in (Baines and 
Wathen, 1985). 


8. THE METHOD IN HIGHER DIMENSIONS 
One of the most promising aspects of the MFE method is 


its straightforward generalisation to higher dimensions. 
We consider here the two dimensional scalar conservation law 


u tf, +g, Oil's (8.1) 


t Nf 

We again approximate the function u by a piecewise linear 
function v given by equation (2.2), where the basis functions 
are now two dimensional "pyramid" functions as shown in 
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Figure 7. The time derivative YE of the function v now 


however belongs to a subspace Sug of the space of piecewise 
linear discontinuous functions S, on the two dimensional mesh, 


$ 
because it has to correspond to a continuous v and not all 
members of Bi do so. This is because there are generally 
more elements surrounding a node than there are nodes at 
vertices of an element (see Figure 8). 


1 


(a) (b) 


Fig. 7. Basis functions (a) a, and (b) B, in two dimensions. | 


ki k2 


Fig. 8. Nodes/elements in two dimensions. 


From a local elementwise point of view we can readily 
calculate the b, in the two dimensional generalisation of 


equation (3.4), which is now a 3x3 system. But in order to 
obtain the nodal velocities, which are evaluated from the 
union of equations (3.10), we require W , the union of the 


$ WSs to lie in the range space of M (see (3.14)), which 


is now rectangular. 
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It is therefore necessary to constrain the projection which 
leads to the vector w. The result of this constrained 
projection is that w satisfies 


Mow = Mb (8.2) 
which, since Mb =g (8.3) 
leads to the familiar form 
Tira 
MCM =g, (8.4) 
ie. it is equivalent to the standard method (see for 
example (Wathen and Baines, 1985)). 
Hence the decomposition 
A = MCM (8.5) 


holds in higher dimensions, but although the matrix C is 
square 2x2 block diagonal (using elementwise numbering) 

the matrix M is rectangular. For example, in two dimensions 
using nodewise numbering M takes the form N where 


jl jl 
N = diag {n,} Nj = j1 -m52 “Ayo (8.6) 
z amir “Nyy 


where manja are the slopes Vly of the function v in the 
x and y directions within the element. 
Using a permutation matrix Q to map between the 


elementwise numbering and the nodewise numbering we obtain 
the consistent decomposition 


A = NOON , (8.7) 
where M = QN. 


It has been shown by Wathen (1984) using this decomposition 
that if D is the matrix consisting of diagonal blocks of the 4 


MFE matrix A then the eigenvalues of the matrix pia lie 
in the real interval 


~oan iVe BER ARD 
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[z 1+ 4 r (8.8) 


where d is the number of dimensions. There is therefore 
every reason for using a conjugate gradient method with 


D ` as preconditioner to invert the MFE matrix A. 


It is interesting to note that the MFE equations (8.4) also 
arise from first carrying out an unconstrained projection for 
w into the space Si and then doing a least squares 


projection of the equation 


Ny =w (8.9) 


weighted by the matrix co as in (Baines, 1985). 


If S, = B then by a calculation similar to that in Section 


4 we again find as in (Baines, 1985) that Vk’ the velocity of 


the centroid of the triangular segment of the-solution in the 
direction perpendicular to the segment is given by equation 


(4.6), where now 


af = the outward flux of (f,g) across the (8.10) 
element boundary, 


ayy = the vertical profile of the element k (8.11) 


and the angle G is given by 


tan 6, = |vv] . (8.12) 


There is also a corresponding result on the segment rotation 
similar to equation (4.10) but apart from noting that the 
convexity of the flux function is again involved we do not 
set down the details. 


9. SINGULARITIES IN HIGHER DIMENSIONS 


Because of the decomposition (8.5) the singularities of 
the method in higher dimensions correspond to those in one 
dimension, namely, they are singularity of C when the 
element size goes to zero and rank deficiency of M when 
nodes become coplanar: see (Wathan and Baines, 1985). 


i . In the latter case of parallelism the remedy is the same 
a as ‘in one dimension, namely to remove the (linearly dependent) 
ee equations causing the parallelism and to solve a reduced 
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system, adding a suitable multiple of the null space at 
the end. Some care is required with the larger blocks 
occurring in higher dimensions; it is safest to transform 
the local system to upper triangular form, which avoids 
problems of ill-conditioning which can arise if an 
arbitrary equation is omitted. 


Singularity of C arises when the size of an element goes 
to zero as a result of a node running into the opposite side 
of a triangle. Even though we do not have a proved best 
fit property in more than one dimension we may still 
conjecture that this occurrence corresponds to the formation 
of a shock. Technical difficulties arise in determining how 
the shocked triangle should move subsequently but we have 
the following lead in the case Sy = Sag” 

We recall the result stated above on the normal velocity 
Vk of the centroid of the triangle segment perpendicular to 


itself, It can be seen that this velocity tends to the 
local shock speed as the segment becomes vertical in the 
sense that the triangle sweeps out "mass" at the correct 
rate. It appears that we should therefore impose this 
shock speed on all points of the vertical line through the 
centroid of the triangle in the subsequent motion with the 
shock in place. This still allows possible rotation of the 
triangle about the line through the centroid which must be 
determined from the solution of the rest of the system. 

We do not go into the details here. 


We show results for one problem, a generalisation of the 
Buckley-Leverett equation in one dimension. The governing 
equation is 


a (8.13) 


=0 


CEAN 


t 2 


u + (1a)? 


and Figure 9 shows both the initial data and the solution at 
a later time. It is worth noting that even though the 
triangles of the mesh become highly distorted there is 

no ill-conditioning of the MFE matrix. 
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Fig. 9. MFE solution for the two dimensional equation (8.13): 
initial data and solution (isoplots) as in (Wathen, 1984). 


10. CONCLUSION 


In this paper we have shown that the MFE method in 
one dimension is a local method which gives the motion of 
individual element segments in terms of elementary properties 
of the flux function. As a result non-linear problems 
involving scalar conservation laws can be solved simply on a 
small computer. Shocks are particularly well resolved by the 
method, the entropy rules being respected. 
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In higher dimensions the local nature of the method is 
constrained in such a way that an additional least squares fit 
is required to map the local element motion onto the 
nodewise velocities. The technical problem of dealing with 
shocks is assisted by the consistency of the element segment 
velocity with a local wave speed. 
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FINITE ELEMENT METHODS FOR COMPRESSIBLE FLOW 


R. Löhner, K. Morgan, J. Peraire, O.C. Zienkiewicz and L. Kong 
(Institute for Numerical Methods in Engineering, 
University College, Swansea) 


l. INTRODUCTION 


In this paper we review the on-going research work at 
Swansea into the application of finite element based methods to 
the solution of problems involving compressible fluid flow. 
From the outset, our main area of activity has been the 
development of algorithms for modelling supersonic flows, with 
the ultimate aim of producing a computer code for predicting 
aerodynamic heating rates over bodies of complex geometrical 
shape at high Mach number. 


The finite element method can be used to discretise 
accurately any arbitrary-shaped region but the consequence is, 
in general, that the resulting finite element mesh will be 
completely unstructured. An example of such a mesh is given 
in fig. 1, which shows a typical discretisation of a 2D region 
using 3-noded triangular elements, and it may be observed how 
the mesh is lacking in structure e.g. the number of elements 
surrounding a node varies with position in the mesh. 


The use of the finite element method in this manner should 
ease the problems frequently associated with discretising 
geometrically complex domains, as the process of mesh generation 
is decoupled from the solution algorithm, and will enable local 
refinement of the mesh to be made in a straightforward fashion. 
However, as a consequence, We must restrict consideration to 
solution algorithms which can be implemented on such 
unstructured meshes in two and three dimensions. This means 
that optimal implicit schemes, such as those employing 
factorisation methods (e.g- MacCormack (1981) or Beam and 
Warming (1978)), cannot be used and neither can methods relying 
on line or surface relaxation (e.g. Jameson (1983)). In 
addition, methods which are based upon the extension of strictly 


28 LOHNER ET AL. 


1D concepts into two and three dimensions cannot be considered 
(e.g. Colella (1984)). 


va) 


OOS 


POLS 


o> 


Fig. 1 A typical unstructured finite element mesh 


The approach chosen is therefore the use of a simple 
explicit time-stepping scheme with the capability of producing 
time-accurate solutions for transient flows and with the hope 
of coupling the scheme with a multigrid procedure (such as that 
employed by Ni (1981)), to obtain rapid solutions to steady- 
state problems. A computer code based upon such a scheme will 
be capable of a high degree of vectorisation and vector/scalar 
speed-up ratios of around 20/1 will be obtainable on a modern 
supercomputer such as the CYBER 205. However, it must be 
accepted that a program using the finite element method on an 
unstructured grid will be capable of achieving only 0.1-0.25 
of the maximum Mflop rate for such a machine, because of the 
penalty associated with the large amount of gather and scatter 
Operations which are required. For this reason, when the 
solution of large realistic problems is attempted using this 
approach, it is essential to try and ensure that the best 
accuracy is obtained for a given number of grid points. Our 
hope is that this goal can be attained by the incorporation of 
adaptive refinement procedures. 
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2. INVISCID FLOWS 
2.1 The Solution Algorithm 


We begin by considering problems governed by the two- 
dimensional compressible Euler equations in the form 


ou OOF. 
Ergat I 
att Ox (2.1.1) 
where the summation convention has been employed and 
u, 
p P j 
u= eral Coils T i Pa (21.2) 
pig A n2 OES 
pE uj (pE + p) 


Here p, p and © denote the density, pressure and specific total 
energy of the fluid respectively and u is the component of the 


fluid velocity in the direction izi of a Cartesian coordinate 


system. The equation set is completed by the addition of an 
equation of state 


1 
p = (y-l)p [Ee - z uju] (263.3) 


which is valid for a perfect gas, where y is the ratio of the 
specific heats. 


Our initial approach (Lohner et al (1984a)) to the solution 
of this equation set was an implementation of the second order 
Euler-Taylor-Galerkin method introduced by Donéa(1984). The 
resulting algorithm takes the form 


oF 2 oF” ON 
m+l m. j At miaa ai 
ju -U™)N aù = -At d m a-a afa x, Ox, 
Q j S 5) 
2 oF. 
ne m iy. ar (2.1.4) 


where Q is the solution domain, I is the boundary curve of 2, 
n=(n,, n,) is the unit outward normal to I, and 


dF, 
-= (2.1.5) 

AL a 
In this expression the superscript m 


is the jacobian matrix. 
t and the timestep 
m 


denotes an evaluatior at time t = 


åt=t -t . The solution vector U and the flux vectors F; 
m+l m = 
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are nodally interpolated using 3-noded linear triangular 
elements, with nodal shape functions Nye while a piecewise 


constant representation is adopted for the jacobian matrices, 
with element shape functions Po: The integrations in equation 


(2.1.4) may then be performed exactly» and lead to an assembled 
equation system of the form 


m ŝu = £" (2.1.6) 


where M is a consistent mass matrix and 6U is the vector of 
changes in the nodal values of U over the timestep. To maintain 
the explicit character of the scheme, this equation system is 
solved using the iteration process 


miou- suit) SMe -m gut }) (2.1.7) 


(o) 


starting with SU’ = 0, where M, is the lumped (diagonal) mass 


matrix. This process converges in three iterations and is 
employed when computing true transients or steady solutions to 
problems involving flow at high Mach number. If only steady 
state solutions at low supersonic Mach number are of interest, 
then one pass through the iteration procedure is used and this 
is equivalent to solving equation (2.1.6) directly with M 
replaced by Me In this case, in 1D the scheme reduces to the 


Lax-Wendroff method. 


Finite difference workers have consistently avoided the use 
of one-step schemes of this type because of the CPU penalty 
associated with the evaluation and multiplication of the 
jacobian matrices in equation (2.1.4). We can proceed in a 
similar fashion here and develop a two-step scheme which aims to 
remove these troublesome terms from the algorithm of equation 
(2.1.4). In the first step, we use the Taylor expansion 


m 
oF 
ms om At 5 
ee ale i, (2.1.8) 


and approximate ys in a piecewise constant fashion, with uv" 
m 
and F, interpolated as before. The elementwise constant values 


of we can be obtained from the weighted residual statement 


m 
oF 
mHz m At k 
U P a= -> 
ae aan 2 NESTA 
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The solution of this equation system completes the first step 
of the procedure. 


In the second step we note that, correct to first order, 


or” or” 
Hy pm, At =k pm _ At am 
er aera aa ty S 
and hence 
m+; m, 
n aS pa (2.1.11) 
*x Ox, TE ske 


A weighted residual form of this expression can now be used to 
obtain an alternative representation for the troublesome terms 
appearing in equation (2.1.4), which is then replaced by 


m+l m 
U = = 
al U 1N ian 


mts m my =m, 
At alF ag dt IE nar- ae pen -F )Ni al 
(2.1.12) 


where the overbar denotes an average element value, In this 
expression the normal flux H is defined by 


Ea ny 5 (2.1.13) 
and the boundary conditions are applied by ensuring a correct 
interpretation of the boundary integrals: see (Peraire et al., 
1985). Again an equation system of the form of equation 

(2.1.6) results and this can be solved by the iterative process 
of equation (2.1.7). In 1D, this two step scheme reduces to 
the method due to Burstein (1967). 


When the equation system (2.1.1) is linear, the one and two 
step schemes can be shown to be exactly equivalent. For the 
Euler equations, the two schemes appear to be numerically 
equivalent. The two step method is therefore to be preferred 
in practice, due to its speed and beneficial vectorisable 


properties. 
2.2 Artificial Viscosity 


When the two step scheme is used for the analysis of 
problems involving strong shocks, it must be stabilised by the 
incorporation of an artificial viscosity. This is accomplished 


m+ 
by replacing the computed values uv 4 by smoothed values of Us 
calculated according to 


1 
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= ee CTE 2.2. 
Ue uU ry} ry} ) ( 1) 
where 2 denotes the direction of the local maximum rate of 
change of the absolute value of the velocity i.e. 


g= grad |ul/ |graalul| (2.252) 


and the diffusion coefficient, Ree , is calculated as 


ge PAT 
k =c |g Wl (2.2.3) 


In this expression, C is a constant whose value must be 
specified and h, is a representative element length. This form 


for the artificial viscosity is rotationally symmetric and 
reduces to the well-known model of Lapidus (1967) for 1D 
problems. In addition, this model should also be useful when 
viscous problems are modelled, since it will produce only a 
small smearing of boundary layers, where u.£*0O. An 
investigation of the behaviour of this model and a comparison 
with’ familiar models is given in (Löhner et. al., 1985a). 


2.3 Stability 


The explicit character of the solution scheme implies that 
it will be subject to a Courant-type stability criterion. In 


practice, the timestep At is chosen so that 
sah 
e 


At <$ uae (2.3.1) 


where c is the local sound speed, a = 1 when one pass through 
the iteration scheme of equation (2.1.7) is employed, a = 1/v3 
otherwise and s is a safety factor. 


2.4 Domain Splitting 


For purposes of computational efficiency and accuracy, a 
domain splitting procedure has been proposed (for full details 
see (Löhner et. al., 1984b) which groups together elements which 
allow approximately the same time step. The solution is then 
advanced with different timesteps on different portions of the 
mesh in a time accurate fashion. This technique is employed 
in all the numerical examples reported here. 


2.5 Adaptive Refinement for Steady Problems 


The two step solution algorithm described in the previous 
sections has been coupled with an adaptive refinement procedure 
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which is designed to improve the quality of the solutions 
calculated for the steady state Euler equations. The basic 
idea is that the solution is advanced towards steady state on 

an original mesh and then, at prescribed times, the mesh is 
automatically refined in an adaptive fashion and the computation 
continued. In (Löhner et. al., 1985b) the authors investigated 
the use of mesh refinement via mesh movement. However, because 
of its additional flexibility, we are currently favouring the 
use of local mesh enrichment and it is this algorithm which will 
be described here. 


We should like to ensure that if E is the error in our 
calculation then 


max || € ||° ——> minimum (2.5.1) 
e 


where ||.||° denotes some suitable measure of the error over 
element e and the maximum is sought over all elements in the 
mesh. Following Oden (1983) we attempt to achieve the 
satisfaction of this condition by requiring that 


||z || = constant (2.5.2) 


on each element. There are several possible methods for 
obtaining a suitable measure of the error and these have been 
discussed in (Löhner et. al., 1985c). However, our experience 
indicates that the local use of classic theoretical error 
estimates works satisfactorily and is very economical in the 
finite element context. For the Euler equations, the density 
p is identified as the key-variable and the measure of the 
error in the solution is taken to be 


[|e 1° = le - 6l® (2.5.3) 


where |.|° denotes the L, - norm over element e, and 6 is the 


approximate solution. From interpolation theory (see e.g. 
(Strang and Fix, 1973)), we can then write 


k 
Iel|? «cm [ele (2.5.4) 


where EIE denotes the kth seminorm over element e, h, isa 


representative element length and C is a constant. Equation 
(2.5.2) is now replaced by the requirement that 


h lely = constant (265.5) 


or, in practice , since the exact solution for the density p is 
unknown, 
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Ke ea: 
hy llk = constant (2.5.6) 


A mesh enrichment algorithm can be devised based upon the 
satisfaction of this requirement. At a prescribed stage in 
the calculation we determine 


= k e 
y = max ho llk (2.5.7) 
and then refine all elements e for which 
k e -= 
ho lel >ay (2.5.8) 


where a is a constant whose value must be specified. The 
refinement of an element is accomplished by adding a node to 
the mid-point of each side and so sub-dividing the element 
into four. There will be some problems associated with 
"hanging nodes' but these can be removed by surrounding the 
region to be refined with a layer of transition elements and 
sub-dividing some of these elements into two, as required 
(Löhner et. al., 1984c). After refinement, the mesh can be 
smoothed locally and any badly deformed elements which remain 
are removed (Löhner et al., 1985b). Alternatively, a mesh 
reconnection algorithm can be used to prevent the appearance 
of unacceptably large angles in the mesh and the need for 
smoothing can then be avoided. 


The value that should be used for the constant a in equation 
(2.5.8) has been the subject of some experimentation. We are 
currently choosing the value by using a method similar to that 
proposed in a finite difference context by Dannenhoffer and 
Baron (1985). In fig. 2 we plot a typical curve showing the 
variation in the ratio of the number of elements refined to 
the total number of elements in the mesh for different choices 
of a. For an economical refinement process, the value of a 
used is chosen to be in the vicinity of the point a*, which is 
the point where the line, which is parallel to the line joining 
the two ends of the curve, is tangent to the curve. 


The value of k remains to be specified and either the use 
of k=l or k=2 are possibilities here. With k=l, the 
requirement becomes 


26 $ agè- 
ho [ al dx, Ox, ag] * = constant (2.5.9) 


or, since the gradient of 6 is constant over each element, 


2 
Rh H |_ = constant (2.5.10) 
i e XxX, e 
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With k=2, the requirement is that 


2 2 
2 
he [ a | aê 3 ô an]? = constant (23 SiN 


where the second derivatives are interpolated in terms of nodal 
values which are obtained by a variational recovery process 
(Löhner et al., 1985c). The behaviour of both requirements 
has been investigated and our current preference is for 
equation (2.5.11), because of the quality of meshes which are 
produced. 


elements refined N 1 


elements total 
Fig. 2 Determination of the value of a* for the refinement 
procedure 


2.6 Numerical Results 


To illustrate the performance of the adaptive mesh 
refinement procedures which have been introduced above, we 
consider the solution of two problems involving steady high 


Speed inviscid flows. 


The first problem is that of Mach 3 flow past a wedge of 
angle 20°. The initial mesh, and corresponding density 
contours after 100 steps, is shown in fig. 3a. The adaptive 
refinement procedure with local mesh smoothing produces the 
refined mesh of fig. 3b and the density contours produced — 
after a further 100 steps on this mesh are also shown. This 
mesh is now refined and smoothed and the resulting mesh, 
together with the final solution produced, is shown in fig. 
A nice feature of the mesh enrichment procedure is that the 
initial calculation can be made on a relatively coarse mesh - 
all we need ensure is that the mesh is sufficiently fine to 


3c. 
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Fig. 3 Steady inviscid Mach 3 flow past a wedge. Sequence of meshes and corresponding density contours. 
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Sequence of meshes and corresponding pressure contours. 


Fig. 4 Regular shock reflection at a wall. 
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pick up the main features of the flow. The significant 
improvement in solution quality with mesh enrichment is 


immediately apparent. 


The second problem is that of regular shock reflection at 
a wall and is an example which has been studied previously by 
Harten (1982) and Colella (1984). The initial mesh, the 
boundary conditions applied and the resulting pressure 
contours are shown in fig. 4a. The refinement process is now 
entered, using the reconnection algorithm and no mesh smoothing. 
The refined mesh produced and the corresponding pressure 
contours are shown in fig. 4b. A further refinement is 
performed, leading to the mesh and converged solution shown 
in fig. 4c. Again the improvement in solution quality is 
apparent. The steepening of the shocks as the mesh is refined 
can be clearly seen in fig. 5, which plots the converged 
pressure distribution along the line x, = 0.5 for each of the 


meshes. 
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Fig. 5 Variation of pressure along the line x, = 0.5. 
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3. VISCOUS FLOW 
3.1 The Solution Algorithm 


The compressible Navier-Stokes equations can be written in 
the form 


ou OOF oG 
ioe pial atest 
ot a ax, ax, (3:1-1) 


where U and Ei are defined previously in equation (2.1.2) and 


o 
c. =]715 (3.1.2) 
“3 o. 

2j pe 

15 + u92) Fk ox, 


Here k denotes the thermal conductivity of the fluid, T is the 
temperature defined by 


l 
Tso [e - 5 uu] (3.1.3) 


c, is the specific heat at constant volume and the components 


of the viscous stress tensor o, j are given by 


ðu ðu a 
tie yee peel heg et ame (3.1.4) 
ij = ox, ox, ox, ij 
For present purposes it is assumed that the Stokes hypothesis 
is valid so that the viscosity coefficients A and yu are related 
by 


2 
r=-2u (3.1.5) 


The solution approach adopted here is to view equation (3.1.1) 
in an operator-split fashion, treating the viscous terms ina 
simple explicit manner and the advective terms as before. The 
result is that equation (2.1.12) is replaced by 


oN 
m+l m. Hs i m 
= fees = F ar 
il (U U )N a At Ai aQ - At of Ny 


Ox. 
j (3.1.6) 


ms =m. m Ni m 
5 i 5; — ag + At G N.dar 
At Je F )N ar At ls 3x} | i 


and the solution ere may again be determined by a two-step 
approach similar to that outlined above. 
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3.2 Stability 


The timestep limit used comes from a direct generalisation 
of the result obtained by applying these ideas to the solution 
of the linear one dimensional convection-diffusion equation. 
With this equation written in the form 


2 
BO Fig OP Nee 828" (35262) 
ot ox axe 


the scheme will be stable, with a lumped mass matrix, provided 


that 


h? 


e 
At < Tab + an] (352.2) 


The generalisation to the Navier-Stokes equations replaces a 
by |ul+ c and b by max (u/p, K/pc.), where c, is the specific 


heat at constant pressure. For the computations to be reported 
here, the Prandtl number, Pr, has been taken to be constant and 
equal to 0.72 which implies that é 


(w/e) $ = 0.72 (3.2.3) 
( face) 


In practice, therefore, a stability requirement of the form 


2 
sh, pc 


at < ea empa] ACON (3.2.4) 
u KEA 


is employed where s is a safety factor. 
3.3 Numerical Results < 


The first problem considered is the steady state solution of 
supersonic viscous flow over a flat plate. The flow conditions 
correspond indentically to one of the cases considered by Carter 
(1972), using a finite difference scheme. The free stream Mach 
number is 3 and the Reynolds number based on the plate length is 
1000. The temperature of the plate is assumed constant. The 
Sutherland viscosity law (see White, 1974) is used and the 
initial conditions are chosen to be appropriate to the case of a 
flat plate impulsively inserted into the free stream. Free stream 
boundary conditions were enforced along the inflow boundary, 
whereas extrapolation type boundary conditions (see (Peraire 
et. al., 1985)) were applied to the outflow and upper boundaries. 
A symmetry type boundary condition is applied to the portion of 
the boundary ahead of the leading edge of the plate. The mesh 
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used, with 1051 elements and 581 nodes, is displayed in fig. 6a 
and the general features of the solution can be appreciated by 
examining the density contours shown in fig. 6b. Comparison 
of the computed wall pressure distribution with the results 

of Carter (1972) is given in fig. 7. Fig. 8 shows profiles of 
velocity, temperature and density at the outlet section. The 
agreement between the two solutions is good, with a small 
discrepancy being apparent in the profiles of u and p. 
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(a) 


(b) 

Fig. 6 Steady viscous compressible flow past a flat plate at 
Mach 3 and Reynolds number 1000. (a) Mesh, (b) Density 
contours. 


The second problem considered is the Mach 3 flow over a 10 
wedge with an isothermal wall. The free stream conditions are 
the same as those used in the flat plate calculation but here 
the computational domain has been enlarged resulting ina 
Reynolds number of 1.68 x 104 relative to. the distance between 
the leading dge and the compression corner. The identical 
problem has also been studied by Carter (1972) and Hung and 
MacCormack (1976). The problem specification and the mesh used 
are shown in fig. 9a. The computed velocity vectors and 
contours for density, pressure and temperature are displayed 
in fig. 9b-9e. 
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Fig. 7 Computed wall pressure distribution for viscous 
compressible flow past a flat plate, compared with the 
results of Carter (1972) 
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Fig. 8 Computed outlet profiles compared with the results of 
Carter (1972) for viscous compressible flow past a flat 

plate (a) velocity parallel to the wall, (b) velocity 
perpendicular to the wall, (c) temperature, (d) density- 
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3.4 Adaptive Refinement for Steady Problems 


The refinement procedure outlined previously does not appear 
to be ideal for viscous flow problems, as it tends to 
continuously refine shocks, whereas in certain problems we may 
require a better definition of other flow-field features, such 
as boundary layers. In this section, therefore, we describe 
some preliminary attempts at producing refinement indicators 
which are capable of identifying shocks and boundary layers 
and so enable these features to be separately refined if so 
desired. 


Fig. lOa shows an idealised situation in the vicinity of a 
shock and it can be seen that the direction of the vector 2 
defined as in equation (2.2.2), will be approximately normal to 
the shock and hence the quantity du, /32 will be significant 


near the shock, where u, = u.%. This suggests that we replace 
the requirement of equation (2.5.10) by 
2 dup 
a | = = constant (3.4.1) 
e ak e 


if we want to give preference to shocks in the refinement 
process. 


For boundary layers, the situation:is somewhat different 
and fig. lOb shows an idealised flow in the vicinity of a wall. 
Now the direction & is almost normal to the wall, while it is 


the quantity du, /ak which will be significant, where u, = u.t 


and t is a unit vector such that &.t =O. A refinement 
requirement which gives preference to boundary layers might 
then be 


du 
re | ane |_ = constant (3.4.2) 
e! dk e 


To illustrate the performance of these refinement 
indicators we return to the problem of steady compressible flow 
past a flat plate considered previously. The solution shown 
in fig. 6b and obtained on the mesh of fig. 6a is subjected to 
the requirement of equation (3.4.1). The resulting mesh is 
shown in fig. lla, which clearly gives a refinement in the 
vicinity of the shock. When the same solution is subjected 
to the requirement of equation (3.4.2) the mesh which results 
is shown in fig. llb, in which a portion of the boundary layer 
has been refined. Employing both criteria simultaneously 
produces the mesh shown in fig. llc and the corresponding 
converged solution for the density on this mesh is displayed in 
fig. lla. 
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Clearly, much remains to be done in this area and improved 
indicators will no doubt result, but these initial 
investigations have shown much promise. 


shock 


*] ba! 


(a) 


“ee. 


(b) 


Figure 10 Direction of % and t in the vicinity of (a) a 
shock and (b) a boundary layer. 


Fig. lla-lld 


Steady viscous flow past a flat plate, (a) Refined mesh 
produced by shock criterion, (b) Refined mesh produced by 
boundary layer criterion, (c) Refined mesh produced by 
combining these two, (d) Corresponding density contours. 


(figure facing) 
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4. MULTIGRID METHODS 


We conclude with a brief description of some initial j 
experiments performed on the use of multigrid methods to | 
accelerate further the convergence rate for inviscid steady 
state problems (see e.g. Ni(1981), Jameson (1983)). A nested 
sequence of grids can be easily generated, from a coarse 
discretisation of the solution domain, by successively refining 
into four each element in the mesh. Following Ni(1981), we 
then attempt to use this sequence of grids in such a way that 
the fine grid accuracy is maintained while rapidly propagating 
the fine grid corrections through the mesh via the coarser 
grids. For illustration, consider the use of two grids, a fine 
grid with representative length h and a coarser grid of 
representative length 2h. With the fine grid correction 


av (aunt u™® ) calculated via equation (2.1.12) and | 


(2.2.1), the one step scheme of equation (2.1.4) can be used 
on the coarser grid with the approximation 


m jh h 
- =| - x (4.1) 
j At 
The result is that 
t 2h 2h2 ¢ anc? 
al avy, Pan = SE A aut, Pan - SE) alal aw Pol 
At 2at E 
2h 2 
(At) m) 2h 
PUAN SS EIS ot 
n i ml ay y Par (4.2) 
2At Sia ge 


The correction, w”, calculated on the coarse mesh can be 
interpolated onto the fine mesh, the fine mesh solution updated 
and the steps repeated until convergence results. The extension 
to the case of several grids is straightforward. As an example 
of this scheme in operation, we consider the solution of the 
inviscid Burgers equation using 5 grids. The problem is that 
considered by Jameson (1983) and the convergence is shown in 
fig. 12a while the effect of varying the number of grids used 
on convergence rate is shown in fig. 12b. The solution of the 
Euler equations in this fashion is the subject of current 
research. 
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Fig. 12 Multigrid solution of Burgers equation, (a) Convergence 
to steady state using 5 grids, pictures every two 
cycles, (b) Residual decrease using different numbers 
of grids. 
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TEST 


(c) 


Fig. 13c 


Fig. 13 Solution of potential flow past a cylinder using an 
unnested multigrid procedure, (a) Finest mesh and 
problem boundary conditions, (b) Coarser meshes 
employed, (c) Converged solution. 


The use of a nested sequence of grids appears to conflict 
with our intention to accurately represent the solution 
domain with the finest, unstructured, grid. Therefore, a 
preliminary investigation has been made into the feasibility 
of using a sequence of unnested grids. We have begun by 
considering the solution of elliptic problems in this 
fashion and the initial results are encouraging (Löhner and 
Morgan (1985a, 1985b)). Here we demonstrate the procedure 
applied to the problem of determining the stream function for 
potential flow past a circular cylinder. The region analysed 
together with the sequence of unnested grids used is shown 
in fig. 13a and fig. 13b. The solution shown in fig. 13c was 
converged to 1076 in 30 work units. This approach clearly 
offers advantages for finte element based procedures and will 
receive further attention. 
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A BASIS FOR UPWIND DIFFERENCING OF 
THE TWO-DIMENSIONAL UNSTEADY EULER EQUATIONS 


P.L. Roe 
(College of Aeronautics, Cranfield Institute of Technology) 


1. INTRODUCTION 


One of the unifying themes suggested by the organisers of 
this conference is the study of algorithms which adapt 
themselves to suit the problem being solved: Such an 
adaptation is the hallmark of upwind-differencing schemes. The 
theory of such schemes has reached a high level of 
sophistication in the context of hyperbolic systems of partial 
differential equations in one space dimension. That is, in 
seeking numerical solutions to 


wt Alw) w= 0 (1.1) 


where w is a vector of unknowns, and A(w) is a matrix with a 
complete set of real eigenvectors. It is well known that each 
eigenvector corresponds to a pattern of disturbances associated 
with the passage of waves of a particular family. The various 
upwind schemes consist of "pattern-recognition" mechanisms which 
identify these waves in the data, followed by "evolution" 
mechanisms which use these patterns to update the solution. 

The design principle which they all invoke is that information 
should be propagated numerically in the same direction as it 
would propagate physically. Thus, left- and right-moving waves 
have to be distinguished and treated differently. It is in 
this way that the algorithm adapts to the data. 


The topic addressed in this paper is the nature of the 
analogous process in computing two-dimensional flow (and, 
although we do not treat the details, higher dimensions also). 
Since wave propagation is then possible in infinitely many 
directions, we need somehow to select a finite set of 
directions for special treatment in the algorithm. Loosely 
expressed, out of the infinite set of possible patterns, we 
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seek the most significant. This difficulty is present even Tf 
the equations being studied are linear, so this will be the 
only case studied here. Our results relate chiefly to the 
Euler equations which describe inviscid compressible 
gasdynamics, although a similar analysis for other systems of 
equations would be possible. 


2. ONE-DIMENSIONAL ANALYSIS 


To point out the contrast with the two-dimensional case, 
we must first review the well-known analysis for one-dimensional 
flow. Since this paper is limited to a linearised analysis, we 
study 


wy tAMW = o (2.1) 
where A is a constant matrix. In principle, w can be chosen 

to comprise any three independent flow quantities, but certain 
choices both simplify the algebra, and illuminate the results. 


T 
We choose w = (p,u,p) , where p = pressure, u = velocity, and 
p = density. Then the matrix A is 


u pat o 
a=|% u o (2.2) 
ie) pP u 


where a = sound speed = (yp/p) 2, y = ratio of specific heats. 
The constant quantities which appear in A represent some 
average state of which the flow is a small perturbation. A 
simple wave solution of (2.1) is defined to be of the form 


w(x,t) = w(x - At) (2.3) 
and substituting this into (2.1) gives 
(A - AI)dw = O (2.4) 
showing that dw is an eigenvector of A, and à the corresponding 
eigenvalue. Clearly there are three sorts of simple wave 


solution, corresponding to the three eigenvectors of (2.2), 
which are 


a (2.5) 
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with eigenvalues 


A, =u-a AVS =a AZ,zuta (2.6) 


The complete solution to the initial-value problem 
(given w(x,0), find w(x,t) for t > O) is easily written in 
terms of these basic solutions. The data is uniquely 
representable as 


k=3 
w(x,0) = È wit) (x) zh) (2.7) 
k=l 
where the ht) are scalar functions of x. 
The solution is then found as 
k=3 
sete thse Baw Pas ANY) (2.8) 
k=1 


Such a straightforward process cannot be applied to 
nonlinear equations,but by using techniques based on a local 
linearisation, successful numerical algorithms (Roe 1981, 
Harten 1983), employing the above ideas, can be created. 
These algorithms differ very little between the linear and 
non-linear cases, so that although in this paper we confine 
our attention to the linear case, we may be hopeful about 
making the nonlinear extensions. 


The first stage of the algorithm is the "pattern-recognition" 
stage. From the data wi = w(idx,ndt), given over some range 


of i at a particular time level n, we compute the set of 
'fluctuations' 
n n 
Us T ra 


and find their projections onto the eigenvectors of A. 


(k) _(k) (k) 
ere me a (2.10) 


In (2.1.10) the {a} are local wavestrengths. 
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They are easily evaluated as 


(1) 


aja E Sa bap - padu ] 
Se = 2 ER ENA (2.11) 
evel = zal + padu ] 

where A(+) = (Jay - (), > 


We can think of these local wave solutions as a set of 
model flows which are equivalent to the data. Knowing how 
these model flows would evolve over a short period of time 
guides our choice of the evolution algorithm. There are at 
least two possible choices. In one dimension they are 
conceptually different but lead to identical results. In two 
dimensions, as we shall see, they are much more distinct. 


In one dimension the simplest approach is the 'fluctuation- 
signal' concept (Roe, 1981, Roe and Baines, 1981, 1983). Each 
component of the fluctuation is thought of as associated with 
a signal equal to 


At (k ,(k) _ (k) 4 (k) 
FS hp ee oN 
where yi) is the Courant number of the a wave. Then if yd) 


is positive, the signal is subtracted from Nia! and if yh is 


Negative, the signal is subtracted from Wie When all signals 
have been despatched, the solution has been updated to time 
level (n + 1). 


An alternative approach is to define a set of interface 


n 
states we and then to update the solution by means of 


s wae a (2.12) 


E Pe Es a 
a St Ax 5 
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Note that (2.12) is most usually applied to conservation laws, 
for which A dw = dF, where F is a flux vector, so that (2.12) 
is written 

n+l PERE + n+5 

“oog ax p Z-s | > Cersa) 


In the present instance, we are working chiefly with non- 
conservative variables, so that no F exists, but (2.12) can 


still be used. To use the model flow to find Wea! it is 
usually assumed (e.g. Harten 1983) that the states Wye Wet are 
constant either side of x = (i+s)Ax at t = nât, so that the 


flow coincides locally with the solution to a Riemann problem. 
On the interface x = (its)Ax, for t > nåt, the state isa 
constant one which differs from wy by virtue of the left-running 
waves, and from Mia by virtue of the right-running waves. 

We can easily show in fact that 

(k) (k) | (k) 


comer (2.14) 


nt+4 ) aes 


n n 
Magy EE a) eee EA 


k 


With this definition of the interface states, it will be found 
that the algorithm (2.12) gives exactly the same result as the 
'fluctuation-signal' algorithm. 


Starting with these basic ideas, the one-dimensional 
algorithms can be refined considerably. When the problem is 


nonlinear, the z% and a are no longer constant, but must 
be evaluated within each interval (Roe, 1981). The schemes 

can be modified to achieve higher-order accuracy without 
sacrificing monotone behaviour at discontinuities (Harten, 1983, 
Roe and Pike, 1984). Versions exist which can be guaranteed to 
produce the physically-correct (i.e. entropy-satisfying) 
alternative, whenever ambiguous solutions arise (Osher, 1984, 
Tadmor, 1985). 


In the present paper, however, we wish to avoid these 
complexities, concentrating instead on a more basic issue 
which arises when we attempt to apply the upwind concept to 
flow in two (or more) space dimensions. This is the problem 
of ensuring that the numerical processes really do imitate 
accurately the richer variety of flow phenomena to be found in 
two dimensions. Difficulties associated with the pattern- 
recognition stage are discussed in Section 3, and with the 
evolution stage in section 4. 
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3. TWO-DIMENSIONAL ANALYSIS 
3.1 Simple wave flows 


To analyse the two-dimensional problem, we retain the same 
variables, but add a second velocity component, so that w = 


urvo and the equations of motion become 


wtAw, ape at? (3.1) 
where 
u par oOo vo as o 
a= |% u 00 B= 0 v o (3.2) 
o o u O Ao fe) 
o O uj; o o v 


Simple wave solutions of (3.1) are of the form 
w(x,y,t) = w(x cos® + y sin® - At) (3.3) 
which implies that 
(A cos6 + B sind - AI)dw=0. (3.4) 
Hence dw is an eigenvector of (A cos@ + B sin@) and à is its 
eigenvalue. (For more detail on simple wave solutions in the 
non-linear case, the reader may consult the recent monograph 


by Majda, 1984). 


The eigenvectors of (A cos® + B sin8) are of three kinds. 
Acoustic disturbances are represented by 


x, (8) =| a cosé (3.5) 


with eigenvalue 


à (8) =ucos6 + v sinô +a. (3.6) 
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‘Entropy waves', across which the pressure and velocity do not 
change, are represented by 


(3.7) 


= 


r 
-e 


© OO: O 


with eigenvalue 
re = u cosd + v sind . (3.8) 


Finally, 'shear waves' across which the tangential component 
of velocity changes, and which have no analogue in one- 
dimensional flow, are represented by 


o 
r = |-a sinô (3.9) 
Zs 
a cos® 


o 


with the same eigenvalue (3.8), i.e. XS = AB 


Yiat j 


Fig. 1 The geometry of plane unsteady waves in two-dimensional 
flow. 
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The geometrical interpretation of the eigenvalue is shown 
in Fig. 1. Disturbances which pass through x = O, y = O, at 
t = O with constant velocity can be displayed in coordinates 
(x/t, y/t). On such a diagram the fluid velocity is the point 
(u,v) and the Mach circle has centre (u,v) and radius a. An 
arbitrary tangent to that circle represents an acoustic wave 
moving with speed à (9) - An arbitrary line through the centre 


may represent either an entropy wave or a shear wave, moving 
with speed rer 


Clearly, the search for simple wave solutions has much in 
common with the search for characteristic surfaces (Courant 
and Friedrichs, 1954). Indeed, a plane in (x,y,t) which can be 
a surface of constant state in a simple wave is also a plane 
within which a characteristic equation can be written. There 
is, however, a_crucial distinction. Characteristic analysis 
is concerned only with the equations of motion, and tells us 
only that certain paths may be used to transmit information. 
It does not tell us which paths are actually being used, and 
to obtain that information we need both the differential 
equation and the data to which it is applied. Since the 
number of possible paths is infinite, corresponding to the 
arbitrary tangents and diameters in Fig. 1, some restriction is 
needed if the theory is to be of assistance in numerical work. 
Davis (1984) sought a single dominant direction which would 
indicate the presence of a shockwave, and obtained an 
encouraging improvement in the quality of shock-dominated 
calculations, 


Our proposal is to use simple wave analysis to provide a 
local model of events in the flow, which may be matched to 
patterns observed in the data over some small part of the 
computational domain. We assume that over any region 
sufficiently small that the variation of the data is accurately 
represented by a linear function, the data may be replaced by 
a 'model flow' which is a superposition of simple waves. It 
will be the evolution of this model flow that guides the 
evolution stage of the algorithm. 


The model flow will be one which matches the data both as 
regards its mean state (this happens automatically in the 
present linearised study) and as regards the spatial 
derivatives Wer wh (which comprise eight scalar quantities). 


The model flow must therefore contain eight free parameters, 
but it is not obvious what set of parameters should be chosen. 
The following properties seem natural requirements: 
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(i) If the data are sampled from a flow which is one- 
dimensional in some direction other than the coordinate 
directions, the model flow shall be the one-dimensional flow 
in that direction. This requirement recognizes that two- 
dimensional flows contain one-dimensional features (such as 
shocks) and asks that our description of them shall be, in a 
sense, coordinate-free. 


(ii) Whatever data are provided, the process of matching the 
model to them must be fast and direct. This remark is made 
because it invalidates many previous, unpublished efforts by 
the author where the parameters of the model turned on solving 
a cubic equation, or worse. Even if the use of such models 
improved the quality of the results, they would be too 
expensive for use in practical codes. 


(iii) The parameters of the model must be real-valued for 
arbitrary data. This obvious requirement has also been a 
source of past failures. For example, a natural model might 

be composed of two acoustic waves, one entropy wave, and one 
shear wave, which is indeed how any steady supersonic flow will 
appear. Matching the parameters of this model to arbitrary 
data (as is done in Section 3.3) leads to a set of eight 
equations whose solution depends on the roots of a quadratic 
equation. Unfortunately it is easy to construct data for which 
real roots do not exist. 


3.2 Operator splitting 


The simplest possible model flow is one which is merely the 
sum of two one-dimensional flows in the coordinate directions. 
Formally, we set 0 = O or 0 = 4m in (3.5) to (3.9) to obtain 
eigenvectors 


pa? o o 
ikt (x) 
zD = |+a |, meet BISOI LEa A Ci BA (3.10) 
o [°] a 
P Pp o 
If we set 0 = + 1/2, we obtain 
pa o ° 
(yH) _ (y) _ ih ake 3.11 
EEN Oo}, x Oj}, x al. ( ) 
+a [e] o 
p Pp o 
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These are merely the one-dimensional wave patterns, with the 
additional feature of shear waves. For example, a discontinuity 
(x) 
in v, i.e., (E 
the x-direction with velocity u. One way to generate a model 
flow is to assume that the x-gradients are accounted for by 
(3.10) and the y-gradients by (3.11). If we have data at the 
four points ABCD as shown, we may approximate 


) appears as a vertical line propagating in 


D A 
pecan £ b 
E onstage iG EDL (352) 
ml weet ao we) (3.13) 
a Cy nS 2A) AN Bier ness TD 
and set 
e (x-) (x) (x) (x+) 
Wy agr + GPE + GE + Or. 4 (3.14) 
We (y-) (y) (y) (y+) 
Wm Bi Ee: yt Pate ie tsk oie * Byker) se (3.15) 


In effect, this is what is done in many existing codes which 
apply upwind methods to multidimensional problems (Sells (1980), 
Lytton (1984), Woodward and Colella (1983)), but it does not 
meet our first requirement . Suppose that the data ABCD is 
taken from a region across which an acoustic wave is 
propagating at an angle 6. Both Y and k will be multiples of 


£, (8) (see eqn (3.5)). 


The calculation (3.14) will result in 
A 2 


pa pa o 
uss k |a cos6| = šk(l - cos6) |-a | + k sin6] o 
a sind o a 

p p o | 

pa 


+ &k(1 + cosé@) (3.16) 


a 
o 
p 
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and will indicate non-zero strengths for three waves whose 
velocities are u - a, u, u +a, instead of a single wave with 
velocity (u cosé + v sin@ + a). This will frequently result 
in spurious signals being sent in inappropriate directions. 


We conclude this section with the remark that the above 
objection to operator-splitting would be totally lacking in 
force if the matrices A, B in (3.1) happened to commute. It 
would then follow (Strang, 1980, p. 193) that A and B shared 
the same set of eigenvectors, so that the pattern-recognition 
problem would then be the same in any direction, since 
A cos® + B sinô will also have the same eigenvectors. In 
practice, A and B seldom do commute, In the present case of 
the Euler equations, the sets (3.10), (3.11) have only one 
common element (unless a = 0). It is worth bearing this point 
in mind, since theoretical studies of operator-splitting 
(e.g. Crandall and Majda, 1980) have considered mostly scalar 
problems. Even in that case, non-linearity introduces 
difficulties, but the additional complications due to lack of 
commutativity cannot be revealed. 


3.3 Alternative models 


Fig. 2 The waves used to generate the discrete flow model. 


To avoid the undesirable features of operator splitting, it 
seems necessary to leave at least some of the wave directions 
unspecified, as part of the eight free parameters of the model. 
The following model, taken from (Roe, 1985) has useful 
properties. The acoustic part of the fluctuations in the data 
is represented by four plane waves (3.5) travelling 
orthogonally to each other (see Fig. 2). The four amplitudes of 
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these waves (aj ragrag) and the one independent direction (0) 


contribute five parameters to the model. An entropy wave (3.7) 
of unknown amplitude 8 and the direction $ contributes two 
more, and the one remaining parameter can be taken as an unknown 
vorticity, w. Note that on the small scale where all 
variations appear linear, uniform vorticity cannot be 
distinguished from the effect of two shear waves, of equal 
strength, intersecting at right angles, but otherwise of 
arbitrary orientation. This observation allows the effects of 
vorticity on the evolution of the flow to be handled 
numerically in the same way as the effects of acoustic and 
entropy waves. 


If the flow gradients due to each assumed component of the 
model are added together and equated to the gradients observed 
in the data, the following system of eight equations is 
obtained: 


a cosé + a cos9 - a sing - a sind = Pa 7 SLTA) 

asin + asing + %,cos@ + a,cosé = Py 7 (3.17) 
2 2 2 a 

a cos 0 -= acos 0 + asin 6 - asin 6 = Ug eek oC), 


a »sin@cosé ~ a,sin@cos® - a,sin@cosé + a,sin@cosé - tw/a 


2 
=U., 3.174) 
k ( 


a, sin@cosé - a sinêcos® - a,8in@cosé + a,sin®cosé + w/a 


= vy, y (3.17e) 
asin? - a sino + acos” 9 - a4cos” = Xy PRS 07E) 
a cos@ + acoso - asing = sind + Bcos¢ = Re Pe SeL7g) 
a) sing + asin + 43cos@ + a,cos® + Bsing = Bey Sees 7H) 


Here, the gradients are nondimensionalised as follows 


E ET. E T E 


p] 1 etc. 
pa“ 3x a ox p ox 
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Note that these equations are linear in the six unknowns 


(4 15 157% 4080) but non-linear in the two unknowns (0,$). 


Such non-linearity is the inevitable consequence of admitting 
unknown directions into the model. In principle, it means that 
a set of real-valued parameters matching the data may not exist, 
or may fail to be unique. Actually, equations (3.17) do have 

an extremely simple explicit solution which gives real, unique, 
and physically significant, parameters for any data. It seems 
that the assumption of orthogonality of the acoustic waves is 
crucial in producing equations for which a tidy solution is 
possible. 


We have immediately 


w= a (V = vy) (3.18) 
and 
Bcosd = Ra - Pa , (3.19a) 
i = = 3.19b 
Bsinġ Ry Py 5 ( ) 


whence 8,$. Thus the terms which model the vorticity and 
entropy effects emerge simply and clearly. The next step is to 
solve for 9. If (3.17f) is subtracted from (3.17c), and 


(3.17a) is added to (3.17e), we find, 


Voie: 


tan2e = = - (3.20) 
cs 


This equation has an encouraging kinematic interpretation. 
Consider the fluid particles which at some instant form a small 
circle of radius £ around some arbitrary point. Relative to 
that point, their coordinates are 


{ecosy, esiny} . 


After a small time At, they have moved to new coordinates 


{ecosy + (u + ecospu, + esinyu,) At, 


esinp + (v + ecospv, + esinvv,) At}. 


Relative to the point (uAt,vdt), their new configuration is a 
curve whose polar equation is: 
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rêp) = eP + u At) cosy + u åtsiny)* + 


((1 + vyAt) siny + v,btcosh) "I R (3. 21) 


To first order in At, this is an ellipse:- 


Phe e Liee 2(u,cos*y + (u, + v,)sinycosy + v,sin“y) de] 
(3.22) 


The principal axes of this ellipse are found by setting 
ar/ay = O, which yields eqn (3.20). Therefore the acoustic 
waves produced by the model are aligned with the directions of 
maximum and minimum strain rates in the fluid. It is also 
relevant to compute what those extremal strain rates are. 
Inserting (3.20) into (3.22) gives 


2 2 
emis a Vy +ar) At] (3.23) 
where 
ep) 2 2 
= + - 
aR (vy a + (uy a) m (3.24) 


These expressions appear when we compute the amplitudes of 
the acoustic waves. Rewrite (3.17c,f) as: 


L 
a 


(a, - a) cos”8 + (03 - ag) sino = ' (3. 25a) 


(3. 25b) 


Li 
< 


@ - dy) sin? + (a3 - a4) cos76 
and solve for (a, - a.) , (a, - a). Thus 


U_cos“@ - v_sin“e 
Des eS aia 
i 

2 cos 6 - Eine 


ume: + cos20) - 4v_(1 - cos26) 
CRER Ls SOR eS a See 


cos26 
=4Lu, + va R] . (3. 26a) 


Similarly, 
Sau tg = aoe hv, R] 5 (3. 26b) 
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Compare these results with (3.23). To complete the analysis 


we find expressions for Oy + Gor a3 + agr using (3.17a,b): 
a + a, = P_cos6 + Pysind 5 (3.27a) 
a, +a, = Bycone - P Sin . (3.27b) 


The R.H.S. of these equations are simply the pressure 
gradients along the principal strain axes. All the parameters 
of the model flow therefore relate very simply to quantities of 
established significance in fluid dynamics. 


It is apparent from the construction of the model that if 
the flow were locally one-dimensional, comprising two acoustic 
waves and an entropy wave moving in the same direction, then 
that flow would be exactly reproduced. This is a big 
improvement over the operator-splitting model. However, a 
shear wave (3.9) is not correctly recognised. In the general 
case, with the wave inclined at an angle 6, the observed 
disturbance would consist of velocity gradients 


U. = -k sinécosé , 
x 
U = -k sins ' 
“f 
v =k coss ' 
% 
vV, =k sinécosé ' 
bi 


and inserting these into (3.18), (3.20), (3.24) and (3.26) 
would yield 


w=R=k , 
e@=6-7/4, 
r a Boy ae kL as 


The disturbance would therefore be seen as vorticity, together 
with four acoustic waves of equal strength, inclined at 45 to 
the direction of shear. It would be a little fanciful to 
associate this representation with the sound which is radiated 
from shear layers as a consequence of their instability. 
Nevertheless, the model just described has a far better chance 
of describing the flow realistically than has the operator-split 
model. 
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4, TWO-DIMENSIONAL EVOLUTION 
4.1 Preliminary remarks 


Just as the pattern-recognition half of an upwind algorithm 
requires new properties when deployed in more than one 
dimension, so the evolution part also raises new and unexpected 
problems. These can be illustrated in a setting even simpler 
than the linearised Euler equations. We consider the equation 
of uniform linear advection 


u, + au. +bu =0 (4.1) 
T x y 


where u is a scalar unknown, and a, b are constants. The one- 
dimensional version (b = 0) of (4.1) has been an unending 
source of inspiration for methods which can be avplied to more 
complex one-dimensional problems, but (4.1) itself has been 
less fruitful. In this section we explore some possible 
reasons. 


Note that we must restrict ourselves to solving (4.1) by 
methods which hold some promise of being extended to more 
general problems. For example, the operator-splitting 
technique is an excellent way of solving (4.1) (whose 
"matrices' a, b clearly commute), but we reject it because of 
the objections in Section 3. We also restrict ourselves to 
seeking only the simplest first-order scheme, analogous to that 
described at the end of Section 2. The supposition is that 
once a satisfactory first-order scheme is available, higher- 
order extensions can then be made. 


4.2 Strategies for linear advection 


Equation (2.9) is a measure of the lack of equilibrium 
within a one-dimensional interval [i, i + 1]. An analogous 
method within a two-dimensional '‘interval' [i, i+ 1] x 
[4, j + 1] (square cell ABCD) would be 


=A w B 
aai Zoe Ya * Me ~ Yo > Mp) ay a Ya o t MD) * 


(4.2) 


Tn a time At this requires the solution to change (somewhere) 
by Q iP 4 aptr and this change, for the linear advection 


equation (4,1) would be 
ôu = Qi jatt = AY) + vo) (a ~ a) + Ay = 5) (ay - up? 
(4.3) 
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where 
v= AAt/Ax , Vo = BAt/Ay . 
D A j+l 
Ay 
G Bj 
i i+l 


Now let this change be divided into four parts with weights 
a,8,y,6 where ao + B + y + 6 = 1, so that adu would be 


subtracted from Uy? Bôu from up’ etc. This seems to be a natural 


analogue of the upwind scheme described for one dimension in 
Section 2. We may imagine, for example, choosing (a,8,y,6) 
according to the signs of (vy) so as to create an upwind 


scheme. Now a property which enables a seheme to represent 
discontinuous solutions without creating spurious oscillations is 


+ 
monotonicity, defined as follows. Let uy ; be computed as some 
' 


function of various u values at time level n, i.e. 


untl 
i,j 


where each argument of the function H is i a for some (p,q). 
' 


= R(hy, hy, h, PATTA hy) (4.4) 


Then the scheme is monotone if, for all k, 


= SHON (4.5) 
k 


Theorem There is no set of constant weights (aB, y) which 
yields a monotone algorithm, consistent with the differential 
equation. 

n+l 
Proof In the general case with a,8,y,6 all non-zero, uj will 


depend on nine items of data. With notation as in the diagram, 


taking 
7 8 l ma 
ùj te es we have 
k=8 
6 2 wt oy ou (4.6) 
Oe k=O 


3 


72 ROE 


where the Ck are easily found to be 


1- ha(v, + Vo) + AB(vy = Vy) + ayi + Vy) + 46(vy = Vo) 


(>) 
w 


c = - hy (vy + v2) 

c, = ay (Va mS 46 (v) + Vo) 
c, = 46 5 = v) 
c, = ka (vo = v) + 46 (v) + Vo) 
i 
= ha (v + v2) 


c = halv = Vo) + aB (v) + Vo) 


c} = aB (v) - V) 


cg = - 8B(v) + V4) + ayi - ya). 


(4.7) 


To achieve monotonicity, we seek (a,8,y,6) such that for given 


(V1) none of the S% is negative. Now we observe that 


c, +c, +ep+cg=0, 
and if no c, are negative, this implies 
CoS Ce SCL sO, 
whence 
a = 
ae Goo aes ae CEEA 
1 2 2 x, 1 2 1 2 


But this woulā give 
a+B+ty+é=0, 
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whereas for consistency with the differential equation we 
require 


a+r Py HOaAD. 


Having failed to generalise to two dimensions the first 
of the one-dimensional upwind schemes mentioned in Section 2, 
we now attempt to generalise the ‘interface flux' approach. 
There seem to be two natural ways of doing this, and the first 
of these is illustrated on the left. At the centre of each 
cell, define average quantities 


up au, + Bug + yu, + Su, ; 
to = ou, + Buy + yu, + du, ' 
(4. 9) 
up = ous + Bug + yu, + ôu r 
Uy = aus + Bu, + Yug + Su, 2 


where again a,8,y,5 are arbitrary weights summing to unity. 
Now update Uy according to 


v v 
u+l u sg eee, a ls 
u = uae rA R S) patent Us uk ug) 
E v -y 
F iow) Be sd | ee 2 = 4L 
aeea uR 5 (up uy) (4.10) 
7 8 l 
W 
6 = 2 
Lá 
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Unfortunately this is not an independent approach. 
Substitution of (4.8) into (4.9) merely reproduces (4:4). 
Therefore, we have to attempt a third class of scheme, 
illustrated in the last diagram. Here we begin by assigning 
suitable mean values to the mid-points of the cell sides, 
W,X,Y,Z, thus: 


u, gu. 


i 1, + Bu, + yu, + du, + cu, + dug , 


u, = ou, + Buy + yu, + Su, + eus + ou, ' 


(4.11) 


[i 


u = au, + bug + cu, + du, + eu, + fu, ' 


u, aui +t bu + cu, + du, + eu, + fu. , 


Y 6 o 2 3 4 S 


werea+tB+ty+tô+te+tġ=at+tb+c+d+e+f=l. 


Then u5 is updated by the formula 


n+l n_ 
Uy UF - vy (uy - a) -= Vola, - uy) ° (4. 13) 


This approach will again lead to a nine-point scheme of the 
form (4.4) but 


a 
i] 


1 - (B + €)v, - (b + e)v, , 


fe} 

c =~ av T cv, ' 

Co = - Bv] + (c - avy , 

c3 =- YY) + av, ’ 

c4 = (y - §) vy + ev, 3 (4. 14) 
c = $v, + fv, * 

Cg = eV + (a- £) va r 

cy = ov, = avo 1 

cg = (a -= ov - bv 


2 . 


This approach does yield monotone schemes, and in great 


variety. For example, if it is given that Viya are both 


positive, then we can easily ensure c > O for all odd k by 


k 
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taking (a,y,a,c) all negative, and (6,$,d,f) all positive. 
Positivity of So is assured for Vaya sufficiently small, and 


of the remaining even coefficients by choosing the last four 
parameters to satisfy 


2 


By s (¢- av, so , 
ev) > (f£- a) v, $ Ohi 
bv, $ (a - LAST so , 
ev, > (6 - Yy E EIMAN 


The simplest example of a monotone scheme in this class 
would be the straightforward upwind scheme defined by € = e = 1, 
with all other parameters zero, but it would obviously be 
possible to find many others. 


Evidently we may create similar schemes based on non- 
quadrilateral meshes. The plane could be divided into 
arbitrary polygons, and values of u could be assigned to their 
vertices. In this situation we can devise algorithms 
corresponding to each of the above types. 


Type A Evaluate a fluctuation over each polygon and send 
signals to each vertex of that polygon. 


Type B Evaluate an average state at the centre of each 
polygon, and update each vertex by computing the fluctuation 
over the centres of the polygons which meet there. 


Type C Evaluate an average state at the centre of each edge, 
and update each vertex by computing the fluctuation over the 
centres of the edges which meet there. 


Although we have no proof, we conjecture that there are 
never any monotone schemes of Types A or B. We have not found 
any for triangular or hexagonal meshes. However, we can 
exhibit a monotone algorithm of Type C for a fairly arbitrary 
mesh, (see Fig. 3). 
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Fig. 3 Part of a general tessellation, and its dual. 


Inside each polygon construct an interior point. Join these 
up to form a dual mesh so that each vertex of the original mesh 
lies inside a cell of the dual mesh. (If this cannot be done, 
the mesh was probably not sensible anyway.) Assume u = constant 
inside each of the dual cells. Then the flux across the 
boundary between cells A and B is 


Qas = (ap aU, or” (Eaa (4,15) 
where Zag is the vector representing the interface MN between 
A and B, a is the advection velocity (a,b) and we select uy if 
a is directed out of A, otherwise Ua: Now let 2Q,y be the sum 


of all such terms over the edges of the dual cell surrounding 
A, whose area is Sa: Then 


ðu, 

A 

Zoar T Sa Oe 

defines a semi-discrete evolution scheme for advancing the 


solution in time. If we make a sufficiently small finite 
time-step in the simplest way 


(4.16) 


n+l TA At 7) 
uy ua s, vio (4,17) 
it is clear that the new value of Uy will be some average, with 


positive weights summing to unity, of the old values of u at A 
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and certain neighbouring cells. The scheme is therefore 
monotone, and hence stable. Now that we have it, the 
assumption of piecewise constant states may be forgotten. It 
was merely a device to derive the scheme. 


To effect a connection with conservation laws, we write 
QB = (Zape8) Uap 

where u, i 
AB is one of u up’ and express Up in a form that can 


be generalised to yield an analogue of (2.16). We write 
= + 4 i = 
Uap (uy u,) ‘sgn (rpa) (u, T u,) (4.18) 


and note that 


wry = Spp’ 9rad u (4.19 
where SaB is the vector representing AB. Thus 
= + - $ . . 
Yap alua up? 4sgn (rag a) (Sag grad u) (4.20) 


which closely resembles (2.14), and would reduce to it for a 
one-dimensional case. 


In general, we will not know grad u except in so far as we 
can deduce it from the values given at the vertices. Unless 
the variation of u over each polygon is exactly linear, the use 
of an estimated građient will leave (4.19, 4.20) only 
approximately satisfied. In consequence, the scheme based on 
(4.20) is not monotone on general grids. However, it is 
monotone on triangular grids, because u can then be assumed 
linear over each cell without inconsistency. 


4.3 Application to systems of equations 


The ingredients needed to construct a genuinely two- 
dimensional monotone upwind scheme for the linearised Euler 
equations can now be assembled, see (Fig. 4). Construct a 
triangular mesh with flow states w given at its vertices. 
Within each triangle compute the gradients Myr wy Using the 


results from Section 3.3 find the local flow model corresponding 
to these gradients. Now find the dual mesh as follows. Within 
each triangle join the mid points of the sides to the centroid 
of the vertices, creating three equal sub-areas. This 
particular interior point is chosen because we can imagine the 
same integrated quantity of w either linearly distributed or 
constant in each sub-area. We then associate with each edge 
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of the mesh the state 


a %7 (k) (k), _ (k) 
Map ™ HOH, + My) ~ MPson(z a (S,p-2 JE 
(4.21) 
where a is, as before, the vector describing propagation 


speed, and ew) is the vector (a ©? cose ale att sind (k) ) in 
which a = wave strength, 8 = wave orientation, as in Section 3. 


Finally 2) is the eigenvector giving the distribution of the 


(Ke) (k) 


wave's effects over w, so that the product e represents 


the flow gradients due to the «oh wave. The summation is over 
all waves present in either of the cells adjacent to AB. The 
factor 4 appears in (4.21) rather than } in (4.20) because in 
regions of uniform gradient there will be contributions from 
both cells. In general the boundary of the dual cell will 
change direction as it crosses AB; this does not affect the 
legitimacy of (4.20), since the fluxes across all curves with 
the same endpoints are equal. 


Fig. 4 Part of a triangular tessellation, and its dual 
To apply these ideas to a set of conservation laws, 


Us ou Ea SAS (4,22) 
the formula (4.21) can be modified as follows. We now need 
to associate with each edge AB a value of F and a value of G. 
The simplest way to do this is just to take 


PERRE ep eG Sh i 


Bee Wan) ras ye. (4.23) 


Yag 
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Then the total flux through XY is 


rE (ays) iene (XK Bem.) (4.24) 
=ap'¥m ~ Yy’ ~ Sap yw 7 N 


and the formula to update Wy is 
uM PSAs 
My ta Tg, ax * cel 


This formula defines a scheme which is conservative in the 
traditional sense, monotone in the linear scalar case, and 
strictly upwind in any region of simple wave flow. 


5. CONCLUDING REMARKS 


The purpose of this paper has been to provide a critique of 
the way that upwind- differencing is usually applied to two- 
dimensional problems. It has been shown that the operator- 
splitting method can produce misleading results. An alternative 
proposal is made to base the algorithm on a local flow model 
which attempts to recognize flow patterns associated with two- 
dimensional simple waves. These patterns are strongly linked 
to familiar analysis concerning the local kinematics of the 
flow. 


Once the nature of the local flow is established, we wish 
to exploit it in an evolution algorithm, creating to begin with 
a monotone first-order scheme, as a foundation for future. 
work. Through exploring the simplest linear advection 
equation, it transpires that the possibilities are quite 
limited. The author feels that the formula presented in 
Section 4.2 may well represent the most natural way to create 
a two-dimensional upwind algorithm. 
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METHODS FOR THE PREDICTION OF “HYPERBOLIC FLOWS" 
BASED ON PHYSICAL ARGUMENTS 


M. Pandolfi 
(Dipartimento di Ingegneria Aeronautica e Spaziale, 
Politecnico di Torino, Italy) 


l. INTRODUCTION 


The "hyperbolic flows" we are considering here are the 
unsteady and steady supersonic flows of a compressible inviscid 
gas. 


Investigations on unsteady flows are required in order to 
provide a physical description of actual transients as well to 
give steady transonic flow configurations achieved through 
"time-dependent" or “pseudo time-dependent" techniques. The 
"hyperbolic" coordinate is represented by the time and the 
solution is obtained by marching along this coordinate. 


In supersonic regions the steady flow problem also has a 
"hyperbolic" character, This fact enables us to obtain a 
solution of the flow field by marching along a space coordinate, 
and the description of the flow evolution will then be 
provided in this direction. This space coordinate is then the 
“hyperbolic” one. 


Even if trivial, it should be remembered that 2D unsteady 
flow is a phenomenon in which we consider two dimensions in 
space and look for the evolution in time, whereas 3D steady 
supersonic flow is still a 2D "hyperbolic" problem in the 
sense that we look for a description of the flow field over a 
two-dimensional plane and aim to give the evolution in space 
by proceeding in the third space dimension. So, as regards 
the "hyperbolic" nature of the problem, 2D unsteady and 3D 
steady supersonic flows are equivalent. 


In the following we will emphasise unsteady flows and say 
only a few words on steady supersonic flows just at the end. 
However almost everything that will be said for the former 
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flows can be easily translated into the domain of the 
latter. 


"Hyperbolic" flows are characterized by the propagation of 
waves, as is well known. The numerical procedures developed 
on the basis of the method of characteristics aim to follow 
and describe explicitly these waves during their propagation. 
While these procedures are very accurate and consistent with 
the physics in the case of 1D "hyperbolic" problems (that is 
1D unsteady and 2D steady supersonic flows), they become less 
appealing in multidimensional flows for topological reasons. 


In the last decades numerical procedures have been 
developed which are labelled as "finite-difference" (or finite- 
volume, or finite element) approaches to the approximation of 
the original equations on the basis of a discrete number of 
points where the flow properties are represented. Most of 
these procedures have more or less neglected the conceptual 
effort made in the past to describe the propagation of waves 
and attention has been focused mainly on the development of 
numerical aspects. By doing this, the physics has quite often 
been ignored. 


Somewhat different approaches were proposed many years ago 
(Courant et. al. (1952), Gordon (1968) and Godunov (1959)). 
They attempted to include the concept of wave propagation into 
finite-difference approximations. More recently this last 
kind of investigation has again received attention and many 
such approaches are proposed today for the prediction of 
unsteady flows. They are generally known as "upwind" methods. 


Their common feature is the way they are conceived. First 
the Euler equations are analysed emphasizing the 
propagation of waves. The important role of the domains of 
dependence is recognised together with the nature of the 
convection of signals through the flow field. Furthermore, 
important suggestions are made for avoiding the introduction of 
artificial conditions at the boundaries. Moreover it follows 
that some of the variables appearing in the original equations 
are not the most significant and that new variables express 
better the mechanism of the convection of signals. We will 
call this preliminary step of investigation the "formulation". 


Once the formulation has been worked out we can proceed into 
the numerics by approximating derivatives with finite 
differences. But, whatever the numerical algorithm might be 
(first, second or higher order of accuracy, explicit or 
implicit procedures, fast or accelerating solvers, multigrid 
methods and so on), it should preserve and retain the features 
emphasized in the previous formulation, namely the character 
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of the convection of signals: in other words the "upwind" 
idea should be incorporated into the numerical approximation. 


In the following we will consider two approaches belonging 
to this class of methodologies. We have been working on both 
for some years and found them rewarding. 


The first is generally called the "lambda" scheme. Here we 
will call it rather the "lambda" formulation, because its most 
important feature is the preliminary work done on the original 
equations (formulation). After that any more or less 
efficient numerical scheme may be applied. 


The second approach may be called the "flux-difference 
splitting" (hereafter referred as F.D.S.) formulation: here 
the work done in the formulation is mainly related to the task 
of solving Riemann problems (R.P.) in order to recognize the 
propagation of signals. Even here different numerical schemes 
may be introduced as regard the numerics. 


At first glance the difference between these two approaches 
seems to be considerable. Nevertheless they are strongly 
related to each other. For example, in the case of the 
linearized version of the Euler equations it turns out that the 
two coincide exactly from the conceptual and formal points of 
view. 


In fact the main difference between the two approaches lies 
in the original equations on which the formulation is based. 
The first (lambda) is founded on the differential equations 
written in the so called non-conservative form and in terms of 
the primitive variables (the gas velocity and two thermodynamic 
properties, for example the speed of sound and the entropy). 
The numerical solutions are then confined to the field of 
smooth solutions. The second (F.D.S.) is based on the 
conservation form of the equations and provides a numerical 
approximation of the weak solution of the laws of conservation 
expressed by the Euler equations. For this reason the two 
approaches differ with respect to the strategies for 
predicting gasdynamical discontinuities such as shock waves. 


In the following the presentation will be organized thus: 


§ 2. The "lambda" formulation in the 1D problem. 
§ 3, The "F.D.S." formulation in the 1D problem. 
§ 4, The multidimensional problem. 

§ 5. A suggestion for a hybrid formulation. 

§ 6. The extension to supersonic steady flows. 
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2. THE "LAMBDA" FORMULATION IN THE 1D PROBLEM 


The Euler equations written in non-conservative form are: 


Yok 2 
a, + ua, toy au, = 0 
2 af 
= = 2e L 
Ma AES ARET aa, yty=L) Sk o ( ) 
Ss, +uS so 


where the primative variables (speed of sound, gas velocity 
and entropy) are denoted by a, u, and S. 


By rearranging equations (2.1), two new variables may be 
defined, which replace the primitives ones a and u: 


aR, = a a-u) - ey as 
(2.2) 
ar, = ai ato ane 
Therefore equations (2.1) become: 
Rit + (u-a) Ry, =o 
Rat + (uta) Roy =0 (2.3) 


These equations now reveal the wavelike nature of the problem. 
In fact they describe the convection of signals. 


The first two of the equations (2.3) refer to the 
Propagation of "acoustic" waves. The corresponding signals 
are defined in equation (2.2) and coincide with the Riemann 
invariants in the case of homoentropic flow (sy = 0). For the 


less restrictive assumption of isentropic flow (S + us. = 0), 


it is no longer possible to define the classical Riemann 
invariants and the "acoustic" signals can only be defined in a 
differential form (equation (2.2). Of course the entropy 
gradients can only originate at the incoming flow boundary and 
not from the generation of entropy through shocks, equations 
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(2.1) being written in non-conservative form. The third of 
equations (2.3) represents the convection of the entropy wave. 


As the result of the definition in equations (2.2), the set 
of equations (2.3) can be finally written as 


w 
L 


Lyo | a 
Pig [ (a aR + (uta) R, J + s 


t OMe E 

u = -+[-(u-aR,. + (utaR, ] (2.4) 
t 2 1x 3x ' 
Ss, =-u Sk 


On the basis of equations (2.4), we can now describe the 
evolution in time of the flow after the evaluation of the space 


derivatives R., R,, and S, . These have to be regarded as 
1x 3x x 


one-sided derivatives, the side being determined by the sign of 
the relative velocity of propagation of the signals 
(respectively u-a, uta and u). 


All the above considerations represent what we have called 
previously the "lambda" formulation. After an initial 
tentative suggestion by Pandolfi and Zannetti (1978), the 
basics of this formulation have been well laid down by Moretti 
(1979). 


We can now proceed to the numerics. The space derivatives 
will be approximated by finite differences along the space 
coordinate x with one-sided differences, in order to retain in 
the algorithm the physical description of the propagating 
signals. Algorithms of any required order of accuracy can be 
built up. They may be explicit or implicit. In the present 
case we use the simple first order "upwind" scheme, an 
algorithm equivalent to the basic version of the method of the 
characteristics. 


It will be shown by Zannetti (1985), in the case of 
homoentropic flow, that equations (2.4) satisfy also the jump 
relationships of space derivatives across characteristics. 
Therefore the "lambda" formulation agrees with the double 
physical interpretation of the characteristics as lines where 
the signals propagate and where jump relationships of 
derivatives are prescribed. 


A further important feature of the "lambda" formulation 
emerges when we deal with the boundaries. It can be seen that 
whenever a space difference is required behind a boundary (which 
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therefore is not available), a physical boundary condition will 
take its place. So no artificial or additional boundary 
conditions will be introduced. The "post-correction" 
technique, proposed by De Neef and Moretti (1980), should be 
considered as the basic contribution to this important point, 
even in connection with different formulations or procedures. 


As can be argued by looking at equations (2.4) the number 
of operations involved in a numerical algorithm based on the 
"lambda" formulation is quite small and this contributes 
greatly to confining the required computational time to very 
low levels. Furthermore the accuracy of the numerical results, 
for a given numerical scheme and computational grid, turns out 
to be very satisfactory, mainly because we have selected as 
dependent variables the most significant ones. Therefore the 
computational grid can be kept coarse and the computational 
time further reduced. 

However, the solution of equations (2.4) being confined to 
smooth flows, we cannot deal correctly with discontinuities, 
such as shock waves, other than working out some explicit shock 
treatment, such as shock fitting or tracking techniques. This 
is needed to generate the entropy jump, not provided by the 
original equations. 


3. THE "F.D.S" FORMULATION IN THE 1D PROBLEM 


The Euler equations are now written in the conservation form 
related to the laws of conservation 


È = s 
We Fy o (3.1) 
where: 
Pp pu 
2 
W = jpu and F = |-p + pu 
e u(pte) 


Here p, p, and e denote respectively the density, the pressure 
and the total internal energy per unit volume. 


Let us consider the flow configuration at the time t. The 
flow properties being given at a discrete number of points, we 
approximate the distribution of any variable along x with the 
value at the point N kept constant on the two half intervals 
in front and behind it. The picture of this representation at 
the time t, over the interval bounded by the points N and N+l, 
is shown in Fig. l. The discontinuity located at the middle of 
the interval will then evolve in time, on the basis of the 
initial conditions prescribed in the regions a and b 
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(corresponding to the points N and N+l). Three waves (1,2,3) 
will be generated and the new regions c and d will appear. 
Waves (1,3) correspond to the "acoustic" waves and wave (2) is 
the entropy wave. 


Fig. 1 Solution of the Riemann Problem 


The description of this evolution is provided by the 
solution of a Riemann problem (R.P.), which can be obtained 
exactly following classical procedures. So the pattern of the 
waves (1,2,3) and the flow in the regions c and d can be 
evaluated. At this stage we assume that the unsteadiness 
caused by the development of the discontinuity will affect 
the evolution in time of the flow at the computational points, 
as follows. Those signals which are related to the waves 
propagating rightwards will contribute to the evolution in 
time at points located on the right hand side of the interval, 
and the corresponding will be done for the signals related to 
the waves travelling to the left. The signals are determined 
by the difference of the flow in the regions a, b, cC, and d. 


The formulation will be then defined as it follows. Any 
difference of the flux over an interval Dx: 


ENT ave MAL TN 
will be divided into three terms: 


DF = E Fa F (Fa Fo + (Fo Fa (3.2) 
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Each term is a signal and corresponds to one of the waves 
(1,2,3). Then we split the difference of the flux D_F in two 


N 
parts: 
DF=DF+DF 
NON N 


which will contribute to the time evolution at points located 
respectively on the left and right hand sides of the interval. 
The first part will retain the terms of equations (3.2) which 
are related to waves propagating leftwards, whereas the second 
will account for those related to waves travelling rightwards. 


At this stage the formulation is completed and we may 
proceed to the numerics. For example, with the first order 
scheme, the dependent variables at the point N and the time 
t+Dt are evaluated as it follows. First the R.P. is solved in 
the two intervals bracketing the point N. Then we have: 


— 


z LiDE 
Wy(t + Dt) = Wit) - Sa OnE + DF) (3.3) 


More accurate schemes can be built up, always based on the 
—_ 
"upwind" idea for the split terms DF and DF. 


The exact solution of the R.P. (Fig. l) can be generally 
obtained only at the price of tedious and long computations, 
because of the iterations required when one or both of the 
waves (1,3) are shocks. It will then be convenient to look 
for approximate solutions of the R.P. in order to save 
computational time. Two different procedures have been 
proposed. According to the suggestion made by Roe (1981) it 
is convenient to replace the original non-linear R.P. by a 
linear one with certain appealing features (the solution will 
be exact if the wave pattern exhibits a single shock wave). 
The solution is then obtained quickly. On the other hand 
following the procedure indicated by Osher and Solomon (1982), 
we assume that the "acoustic" waves (1,3) are isentropic (even 
if in fact one or both can actually be shocks). Under this 
assumption the solution is worked out easily in a classical 
way. A third approximated solution (in fact closely related 
to the one proposed by Osher and Solomon (1982) and physically 
similar to the one suggested by Roe (1981)) was given later 
by Pandolfi (1984 a), who presented also some remarks on the 
previous two. In a few words it is conceived as follows. 


As suggested by Osher and Solomon (1982) we assume as 
isentropic the "acoustic" waves (1,3). The flow properties 
being defined in the regions a and b (speed of sound, gas 
velocity and entropy) we look for the same variables in the new 
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regions c and d. It follows that the six equations given below 
provide the solution in a simple fashion. 


Conservation of the Riemann invariant R, and of the entropy 


across the wave 1: 3 
Bees +u_= z2 a_ + 7; S_ wg, 
y-1 “c E AE unr cian 


Conditions on the contact surface (that is across the wave 
2): 
c asin iPo Pati Ss. Se ay SP, 2y 


Conservation of the Riemann invariant R and of the entropy 
across the wave 3: 


DGS a PORT SAT d jag 
aT aE a Sg ra KA. N ye) 


If the waves (1,3) are expansion or compression isentropic 
fans, it may occur that one characteristic imbedded in the fan 
is vertical, denoting a sonic transition. In this case a 
further splitting is done on this wave, in order to determine 
which part of it is propagating to the Eft and which to the 
right (see Pandolfi (1984 a)). 


We have performed several numerical experiments by operating 
the splitting according the the exact solution and the three 
above mentioned approximate solutions of the R.P. and have 
found the difference ig the numerical results quite negligible. 
Therefore, as also anticipated by Roe (1981), we can avoid the 
exact solution and apply any approximate procedure without 
penalty in the accuracy of the results. 


The numerics based on the "F.D.S." formulation provide good 
capability of capturing shocks, which come out very sharp and 
neat, For steady configurations shock waves are always 
described in two or three intervals, even with the ordinary 
first order scheme. Second order schemes give better sharpness 
to the shocks, but some flux-limiters or monotonicity criteria 
must be included to avoid spurious oscillations in the 
neighbourhood of shock waves. 


However, on the basis of our experience, the overall level of 
accuracy is not remarkable, at least compared with that 
achievable by the "lambda" formulation. This result could be 
ascribed to the fact that in the "F.D.S." formulation the 
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convection of the signals is hidden in the split parts of the 
difference of the flux and not treated explicitly. 


Furthermore there are several operations involved in the 
preliminary work of carrying out the splitting on each interval 
and the computing time is penalized. 


4, THE MULTIDIMENSIONAL PROBLEM 


We have been so far how to deal with the 1D problem, where 
the analysis is quite clear and without any ambiguity. We now 
try to go from the 1D case to the multidimensional cases. 


The diagrams in Fig. 2 show how the picture changes in the 
transition to multidimensional problems. We aim to describe 
the evolution at the point P from time t to t+Dt. In the 1D 
problem (Fig. 2a) the variation of the flow is determined by 
the merging at P(t+Dt) of the three characteristics (carrying 
the two "acoustic" signals and the entropy) leaving three 
definite locations at the time level t. In the 2D case (Fig. 
2b),, the entropy is still carried on a well defined ray (the 
path of the particle reaching P(t+Dt)), but there are an 
infinity of "acoustic" signals merging at P(t+Dt), which 
propagate on the straight lines which define the Mach cone. 

All these signals leave the plane (x,y) at the time t froma 
circle with radius equal to the speed of sound a. Going now to 
the 3D case (Fig. 3c) the picture cannot anymore be represented 
on the paper and we shall imagine the signals leaving the space 
(x,y,z) at the time t from a sphere (with radius a) and merging 
at the point P(t+Dt) along rays propagating in a fourth 
dimension (the time). 


P(t+Dt) 


3 POI 
pe x 
a a 


Fig. 2a Evolution in time for the 1D unsteady flow. 
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Fig. 2b Evolution in time for the 2D unsteady flow. 


Fig. 2c Evolution in time for the 3D unsteady flow. 


4.1 The "Lambda" Formulation in Multidimensional Problems 


Let us consider the non-conservative form of the Euler 
equations in a cartesian frame of reference. They are: 


+ q.Va =e i= a Veg =0 


ae 
a. + (q-Va + av EET (4.1.1) 
Be SESE eo a] om ae 
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where now q represents the velocity of the gas in the form 


> > > > 
q = ui + vj + wk 


Let us now define the general unit vector ig normal to the 
sphere of Fig. 2c and defining the plane u normal to ae The 
vero ctky may be decomposed into the components Ig along è and 
4, lying on the plane u, namely 

> > > 
tag 
According to the 1D analysis we define now a new variable, 


ae 
related to the vector € by 


aCe ELAES 
aR, = Mee at ag) ¥(y=1) 


Then, from equations (4.1.1) we obtain a compatibility 
equation over the characteristic ray leaving the sphere from 


Pi 
the point where we have selected the vector €, namely 


> > 2 
Ree + (q+ ab) LWT a+ ap) = PEST vs]= - aV. 4, (4.1.2 


Looking at the equation (4.1.2), we realize that it is no 
longer possible to define the Riemann invariants, even for the 
homoentropic flow (VS = 0), because of the source term on the 
right hand side. Rather we may define Riemann variables or 
signals. These will change their intensity while travelling 
along their relative rays according to the intensity of the 
source term, which accounts for the multidimensionality of the 


flow. It is also clear that the choice of the vector = will 
provide a compatibility equation related to the projection of 


the momentum equation in the direction of is 


By following the line taken by Zannetti and Colasurdo (1981) 
> > 
we select for the vector € six values, corresponding to +i, 


> 
+3, and + k. This choice takes into account the equilibrium 
between the pressure gradient and the inertial forces in the 
directions of the three coordinates. We then have 
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AE 
a 3 7 Lorary, + (utayR,, 


+(v-a)R + (vta)R, 
i JY 


jy 


a 
+ (w-a)R + (wła)R l + ay 8 


t 
Usa La(uca RU Ree ue owe 
t 2 -ix -iy -iz 
+ 
(uta) R y + Riy. + WR! (4.1.3) 
v = ak [-uR - (v-a)R - wR 
t 2 -jx TIY: -jz 
+ uR, + (v+a)R, + wR, ] 
Jx Jy Jz 
w=- ds [-uR - vR - (w-a)R 
t 2 -kx -ky -kz 
+ UR + ve + (wta) R J] 
aa e T 


The set of equations (4.1.3) provides the description of the 
evolution. in time of the flow on the basis of space derivatives. 
As in the 1D case, they have to be regarded as one-sided 
derivatives, the side being determined by the sign of their 
coefficient. 


A very large number of numerical experiments have been 
carried out in the past with numerical schemes of second order 
accuracy for 2D and 3D flows. In particular we refer to 
Pandolfi and Colasurdo (1980) as far as 3D flows are concerned 
and point out the capability of predicting correctly non- 
trivial secondary flows, where the role of the rotationality is 
predominant. The ability of the "lambda" formulation in 
multidimensional flows to agree with the physical interpretation 
of the characteristic surfaces will be discussed by Zannetti 


(1985). 
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4.2 The "F.D.S." Formulation in Multidimensional Problems 


The Euler equations are now considered in conservative form 
for the 3D flows described in a cartesian frame of reference, 
i.e. 


VERES EG be Bi Q, 
t x y z 


where 

p pu pv pw 

pu ptu puv puw 
W= |jpv|; F= |puv ji G= pov 7 H= |pvw 

pw puw pvw pt ana 

e u(pte) v(pte) w(-pte) 


Following what has been done for the 1D case, we shall now 
proceed to the splitting of the difference of F,G and H over 
the intervals Dx, Dy and Dz. This can be done by solving 
Riemann problems on each interval. 


For a Dx interval the R.P. will refer to a discontinuity of 
all the variables (a,u,v,w,S) and will be solved in a 1D 
fashion, by taking the velocity components normal to the x-axis 
(v,w) unchanged through the "acoustic" waves (u-a and uta), just 
as is done for the entropy. ‘he approximated solution worked 
out for the 1D case can be easily transfered here (see Pandolfi 
(1984 ‘b)). On the basis of the solution of this particular 
R.P. we can proceed to the splitting of the difference of the 
flux P over the Dx intervals. 


The same procedure will be followed for splitting the 
differences of the flux G and H, respectively over the 
intervals Dy and Dz. 


It' can be easily recognized that such a procedure makes the 
"F.D.S." formulation coincide with the "lambda" formulation, if 
the linearized version of the Euler equations is used. 


Quite often some transformation of coordinates is needed, 
which will generate a non-orthogonal grid in the physical 
region. In this case R.P.s have to be solved over a given 
interval if we desire to reproduce in the "F.D.S." formulation 
the features of the "lambda" formulation. In general it can 
be seen that each test on the sign of a given component of the 
velocity of a signal propagating on a characteristic ray, in 
order to determine the "upwind" side for the relative 
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derivative, will require, as counterpart in the "F.D.S.", the 
solution of a corresponding R.P. approach. The reader may 
refer for this point to Pandolfi (1984’b). 


5. A SUGGESTION FOR A HYBRID FORMULATION 


From what has been said on the "lambda" and the "F.D.S." 
formulations it may be concluded that the first presents many 
advantages, such as precise description of the physical 
phenomenon, a natural way of imposing the boundary conditions, 
accuracy of the numerical results, simplicity in the coding 
and low computational times. Unfortunately it does not give 
correct capturing of discontinuities. On the other hand the 
"F.D.S." formulation seems to be particularly suitable for 
providing good capturing of discontinuities. Furthermore the 
two formulations belong to the same family of approaches and 
coincide for linear problems. 


From our past experience in both approaches it turns out 
to be natural to think about a hybrid formulation, capable 
of picking up the positive features of the previous two. 


The hybrid formulation is conceived as follows. At the 
beginning of each integration step we set at all the 
computational points an index IFORM=O. Then a sweeping is 
done over the whole computational region to locate in which 
intervals a sonic transition from the supersonic to the 
subsonic regime occurs. At the four points bracketing these 
intervals we redefine the index IFORM=1. 


Fig. 3 Computational grid (15 x 40 intervals). 
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Fig. 5 Detail of the shock at the wall (Mach number contours). 


Then we proceed with the computation. At those points flagged 
with IFORM=O we apply a second order scheme based on the 
"lambda" formulation, and where IFORM=1 we adopt a numerical 
scheme based on the "F.D.S". formulation. In computing the 
results shown hereafter we have used simple first order scheme 
for the "F.D.S". Perhaps a second order scheme could be an 
advantage in certain flows. This seems to be the case for the 
transonic flow over airfoils, according to some preliminary 
results we have achieved so far. 


nr r 
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We show here some numerical results obtained on the basis 
of the hybrid formulation. They refer to the test case 
proposed by Foerster (1978), namely the internal flow through a 
parallel channel with a circular bump on the lower wall. The 
grid we used is shown in Fig. 3. It has 15x40 intervals, 
equally spaced in the physical space. It is a very primitive 
grid and it may be modified easily by adding simple stretching 
both in x (to move the inlet and exit boundaries farther from 
the bump) and in y (to cluster more points near the bump). 


Fig. 6 Entropy contours. 


Fig. 7 Detail of the shock at the wall (Entropy contours). 
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The distribution of the Mach number is shown in Fig. 4. A 
detailed picture is given in Fig. 5, to show the structure of 
the captured shock. The isolevel lines of the entropy are 
given in Fig. 6 and the corresponding detailed picture on 
Fig. 7 shows the description of the rotational flow behind the 
shock. Finally Table 1 gives the value of the index IFORM, 
at the computational points. Only 24 points are computed with 
the time consuming "F.D.S." formulation, whereas the others are 
predicted with the efficient "lambda" formulation. 
Other ways can be found for merging the positive features of 
the non-conservative and the conservative formulations. A very 
interesting suggestion and promising preliminary results have 
been recently presented by Dacone and Yagi (1985). 
s=====ITFORM(N,M) (O=LAMBDA 1=F.D.S.)=== 
0900000000000000000000000000000000000000 
0900000000000000000000000000000000000000 
00000000060000000000000000000000000000000 
9000000000000000000000000000000000000000 
0000000000000000000000000000000000000000 
9900000000000000000000000000000000000000 
9900000000000000000000000000000000000000 
0000000000000000000000000000000000000000 
0000000000000000000000000000000000000000 
0000000000000000000000000000000000000000 
0900000000000000000000001111000000000000 
2000000000000000000000000011110000000000 
90000000000000000000000000011 11000000000 
0000000000000000000000000001 111000000000 
09000000000000000000000000001 11100000000 


00000000000000000000000000001 11100000000 


Table 1 
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6. THE EXTENSION TO THE STEADY SUPERSONIC FLOWS 


As mentioned in the introduction, almost everything 
discussed so far for the unsteady flow can be easily 
transfered to the supersonic steady flow. 


The extension of the "lambda" formulation to supersonic 
problems was proposed many years ago by Moretti (1979) and 
after that several codes were developed for practical purposes 
with very good results (for instance see Moretti (1982) and 
Marconi (1984)). 


In the field of the "F.D.S." formulation the practical 
extension to supersonic flows also appears quite easy. The 
only additional problem comes from the fact that in the 
approximate solution of the R.P. the simple form of the 
classical Riemann invariants is replaced by the Prandtl-Meyer 
relationship. However a linearization of this formula makes 
the solution of the R.P. as easy as for the unsteady flow, 
without any penalty in the final numerical results. 


A "F.D.S." formulation and relative second order schemes are 
presented in a detailed form with numerical results by 
Pandolfi (1985) in the simple case of 2D steady supersonic 
flows. Work is under way for extending the procedure to 3D 
flows. 


7. CONCLUSIONS 


"Hyperbolic flows" can be investigated by numerical 
techniques aiming to describe the physical feature of the 
propagation of waves. Two steps are required in defining these 
methodologies. First the formulation, in which the original 
equations are rearranged in order to emphasize the role of the 
propagation of signals on characteristic rays. Then a 
numerical scheme is applied which should retain in the 
numerical approximation the "upwind" nature recognized in the 
previous formulation. 


Two approaches have been considered here. The "lambda" and 
the "flux-difference splitting" formulations. The first is 
very appealing from all points of view except for the 
capability of correct shock capturing. Therefore some shock 
fitting procedure is required when this discontinuity appears. 
The second provides the numerical approximation of the weak 
solution of the laws of conservation and a very sharp and neat 
description of shock waves. However it does not perform, as 
far as accuracy and computational time are concerned, at the 
best level. 


100 PANDOLFI 


A "hybrid" formulation is proposed to pick up the positive 
features of the two previous approaches. It requires the 
"F.D.S." procedure just where a "strong" shock is detected, 
leaving to the "lambda" procedure the responsibility in the 
rest of the flow field. The preliminary results achieved in a 
classical test case for internal flows are quite promising and 
the approach looks efficient as regard accuracy and 
computational time. In particular no spurious sources of 
entropy are generated, as often happens when working only with 
the equations written in the conservative form. 
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SPLITTING UP TECHNIQUES FOR COMPUTATIONS 
OF INDUSTRIAL FLOWS 


J.P. Benqué, J. Cahouet, J. Goussebaile and A. Hauguel, 
(Laboratoire National d'Hydraulique, E.D.F., Chatou - France) 


1. INTRODUCTION 


The purpose of the numerical development at the 
<< Laboratoire National d'hydraulique >> during the last ten 
years is the realistic simulation of industrial flows 
(Benqué et al., 1983, Grégoire et al., 1984, 1985); in practice 
this purpose implies the treatment of complex equations with 
several variables discretised over a large number of nodes. 
The trend is now to develop finite element codes given that the 
use of finite elements allows a natural treatment of complex 
geometry with easy local refinements. Moreover the use of the 
splitting up techniques seems the best way in order to offset 
the increase in computational requirements and also to secure 
the most effective solvers: actually nearly all numerical 
models developed in our laboratory during the decade are based 
on the simple idea that an algorithm is efficient when it has 
a single objective. Progress has been achieved over this 
decade regarding the precision on linking up solutions. Simple 
fractional steps were employed previously and practically 
transparent formulations are in use today. As regards 
industrial computations, it is convenient to draw most benefit 
from the regularity of the grid (as in finite differences) 
while preserving an accurate description of the boundaries (as 
in finite elements); an attractive solution then appears in the 
use of geometrical substructures with specific solutions. The 
use of splitting up techniques in the finite elements context, 
firstly developed for the Navier-Stokes equations, has been 
recently applied for more complex models such as k - e model for 
turbulent flow. This article is a review of the basic 
principles and their most recent applications concerning such 
various fields as domain decomposition or turbulence modelling. 
In section 4, these developments are illustrated by some 
industrial computations in 2-D or 3-D. 
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2. SPLITTING-UP OF OPERATORS - RONAT ALGORITHM (Benqué and 
Ronat, 1981) 


The method of fractional steps was used very early because 
of the initial choice made in our laboratory: transient 
equations for proper processing of non-linearity; implicit 
methods (or to be more precise, no time-step limitation 
required). 


The principle is quite simple; for instance to solve the 
classical transport diffusion equation: 


ranah E E o = E e = 0. (2.1) 


The basic idea is to make advective terms explicit by using 
an upwind scheme as physical as possible, and to build an 
accurate scheme of high order which guarantees unconditional 
stability with low numerical diffusion. This is obtained by | 
upwinding along the physical characteristics defined by: 


f 
dx n +l 
n EA EE e ] 
(2.2) 
nak g i 
x = M, each node of the natural grid. 


The computation of these curves is a simple problem 
reversed in time, solved by applying a Runge Kutta method of 
order 2. In the finite-element approach, the idea of 
upwinding led RONAT (Benqué and Ronat, 1981) to introduce 
the use of test functions, transported along the characteristic 
curves. 


ob uv =o, te [t,t 
(253) 
p(t = t’ i ixj = enS = ġ(x) (classical Galerkin 


shape function) 


{y} then defines two grids, the natural one at time t ao and 
A | f: n 

the upwind grid at time t , constructed at the foot of the 

characteristic curves. It can be observed that this weak 

formulation leads to the projection of the velocity field on 

the space associated with the upwind grid. 


SPLITTING UP TECHNIQUES FOR INDUSTRIAL FLOWS 105 


At each time-step, the treatment of advective terms leads to 
computing a right hand term and then solving the problem as 
shown below in the standard equation (2.5). 


After integrating the weak formulation of equation 2s) in 


n 
space and over time between t and tP + a and after some 


mathematical handling, taking equation (2.3) into account, we 


have: 
ie e $ t n+l an - fe ey Nao 
n+l (2.4) 


+ Í n atf KV£VWd2 = Boundary 
t Q 
term, 
With a Crank-Nicolson discretisation of the diffusion term, 
we have: 
fa et in Kl an + = A xven + lyy” + lag 
(2.5) 


sfe ran + pa XTE VV AN + Boundary term. 


The equation of diffusion and transport then takes the form 
of a simple diffusion equation. The whole advective effect has 
been done away with by the use on‘ the right-hand side of the 


upwind test functions y’. This new formulation splits up the 
operators but is of second order over time. It is possible to 
devise an exact integration rule for a one-dimensional 
computation, and the scheme then shows outstanding properties: 
negligible damping and phase shift for a wave length which is 
greater than five mesh steps. Numerical integration is of 
easier use for two-dimensional computations, but introduces a 
degree of damping. Nevertheless, performance remains very good. 
Many results on classical test cases, such as the rotating 
cone, have been already presented (Benqué et al., 1981, 1984, 


1985). 


3. APPLICATION TO NAVIER-STOKES EQUATIONS 


For Navier-Stokes equations, the treatment of the advective 
term leads to the computation of a right hand side term, as 
mentioned earlier, followed by the solving of a Stokes-type 


problem: 
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T! - vAu + Vp =s /2 (3.1a) 
ve (un * 1) = on (3.1b) 
Sees u,/T (3.1c) 


Algorithms allowing, despite (3.1), an efficient uncoupled 
computation of pressure and of each velocity component were 
sought following the general principles mentioned above. 


The CHORIN-TEMAM projection algorithm, which implies an 
approximate boundary condition for pressure, is one of them. 


It is based on a natural decomposition of L (A) (Chorin, 1968 , 
Temam, 1971) 


1 (9) =H@H+ 


with 2 
H = {ue L (M| V. (w = 0, y 9 = 0} 


where Yn is a trace operator defined on: 


EM = {ulwe 1'(9), V. (w © £7(@)} 


by: Spy, (ww ar = J. u.v wat + fov.(uyw aa 


Hence Y,, (9) is tantamount to u.n for smooth u. 


ut= {yer (M| u = Vp, pe wy (0) } 


To use this decomposition, we transform in (3.1) variable u in 
vi a 


+ 
oe rit EEEo 


with V Py a lifting of ugs? on to 2, the solution of: 
| Ap =o/a 


(3.2) 
= ug-n/T 


v] v 
B( 0 
o 
i) 
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Problem (3.1) becomes: Find (v" Ke oe y p° z f E€ H xH L such 


that: 


7 SUTANE 


The CHORIN-TEMAM algorithm is then the following: 


t + 
For a given estimate of pressure Pa ue 


. Firstly compute v” Han solution of (3.3) 


. Secondly consider the quantity: 


+ + 


At convergence the right-hand side will be the decomposition of 
ai on H and H +. Thus, project t +1 on H- to obtain a 


better estimate of the pressure, and solve 


n+l +1 Pa hita Lie emt n+l 
fa DS Wan = fo fa -war=fave Wan + [Ver vy aa. 


(3.5) 


Vp is a genuine function of H L, when » is a shape function of 
H, (2) , the space where we seek the pressure. 


. Thirdly iterate eventually on pressure. 


An other approach is the UZAWA algorithm. Actually (3.1) is 


+ ki p` + Do For a given 


+I 
and the associated velocity field an 


a saddle point problem on (Gi 


n+1 
pressure by 


hey 
(solution of (3.la) - (3.1c)), the quantity Vun ) is the 


gradient of the functional to be minimised in order to reach- 
the saddle point. UZAWA algorithm is then a gradient algorithm 

_ to solve (3.1); actually we develop a conjugate gradient 
version. In order to accelerate the convergence we have 
searched an efficient preconditioner, a natural idea is then to 
use the previous projection operator: 
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a 


n F 
fe Vs, W an Shaa (un Wa 


=s 1 ¥ Be a (3.6) 


eB Oe 


UZAWA algorithm with preconditioning improves the efficiency 
of CHORIN-TEMAM algorithm by use of optimal direction and 
optimal steepness parameter (Labadie and Lasbleiz, 1983). 


4. DOMAIN DECOMPOSITION 


Jointly to this study of efficient Navier-Stokes solvers for 
3-D problems, have been developed new algorithms of domain 
decomposition. They allow one to divide a computational 
domain into substructures and obtain independent problems on 
each subdomain by the use of a simple symmetrical boundary 
operator. 


The main aim of the geometric splitting is to pull out the 
most available regular part (i.e. topologically equivalent to 
a rectangle) from the initial domain. 


Although industrial cases require locally accurate 
discretisation these regular substructures can represent from 
50% to 90% of the computational domain, the lower bound 
corresponding to confined flows (like the study of thermal- 
hydraulics of fast breeder reactors, cf (Esposito and Hauguel, 
1983) and the upper bound to environmental problems (like 
modelling of the flume of a cooling tower, cf. (Benqué et al., 
1983), (Grégoire and Goussebaile, 1984). 


On these regular parts, we will be able to use specific 
tools like grid discretisation with IJK indexation, multigrids 
methods, ... which allow a great reduction in computational 
cost even in a finite-element discretisation concept. 


Further, because of their high degree of parallelism, these 
methods are well suited for solution by multiprocessor 
computers and let one solve problems involving several 
mathematical models according to the region under consideration 
(a typical example is the matching of a viscous flow and 
inviscid flow for aerodynamic problems). 
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Given the concept of operator splitting mentioned above, the 
main difficulty lies in matching the solution of the Stokes 
problem. Since characteristic curves have been used 
explicitly, advective terms are discarded on the right hand 
side; near an inside junction boundary of a subdomain where 
there is an inflow the advective term is as easy to compute as 
in any other part of the subdomain; indeed, the time-space 
transformation applied to the contribution of the boundary 
term allows its computation to be effected over the 
neighbouring subdomain using the characteristic curves (see 
Cahcuet (1985) for further detail). 


As regards the Stokes problem, adequate boundary conditions 
should be determined along the junction boundaries to uncouple 
the solution operatioun in each subdomain. The analysis of the 
continuous equations shows that the global solution obtained 
on the whole domain is the unique solution which satisfies the 
Stokes problem on each subdomain and ensures the continuity of 
velocity and stresses along the junction: from this main 
result due to MARTINAUD (1984) follows clearly two 
formulations; seek the Dirichlet conditions relating to 
velocity which give equality of the stresses (resulting from 
Stokes equations) on each side of the junction (primal method) 
or vice versa (dual method). These two approaches can be 
formulated in terms of linear boundary operators which are 
symmetric and V-elliptic. 


For the sake of simplicity, here we only consider the 
primal appproach applied on a domain 2 split in two parts a, 
i = 1,2 without overlapping. As shown in Fig. l, 2 and T 


denote respectively the junction boundary and the physical 
boundary of the subdomain R- 


XM 


aT 


Fig. 1 Splitting of domain 2. 
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In order to simplifiy the required expressions we introduce 
the following notation: 


(Vj, pj) {up s usr sit 
i 
which are the velocity and pressure solutions of Stokes 


problems on Q, (cf. equation above) corresponding to u, and u 


boundary conditions on aa, and second member Si on Rs 


and two function spaces in conjunction with the restrictions of 
the velocity vi and the stresses A along 2: 


va {y eH 


YEH o » fyn ar = 0} 


1/2 


where HIS đenotes the interpolate between the classical 
2 
Sobolev spaces ala" and L gai (these conditions taking into 


account the decreasing of velocity near l, ~ I), 


i 
a =H /? ("AR 
where R denotes the equivalence relation: 


1/2 (5) 2N 


V(o 1) © , Ro t-o=Cn, CER. 


We are able now to introduce the primal matching operator K 
mapping V onto T as 


VS CU rS: 
MA 


where oO, is the restriction of the stress along E, calculated 


Senne 


from 


(Yir Py) 10, uy, O} 


H 


Then operator K is an isomorphism from V onto T. Moreover the 
associate bilinear form k(.,.) defined by: 


k: VXV +R 


e > Kun y ese N 
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(where <.,.> denotes the duality pairing between V and T) is 
continuous symmetric and V elliptic (see Martinaud, (1984) for 


proof). Let us now introduce the stress oy obtained from 


vi and Pj) Tupis O, Sj}: it is obvious considering the 
linearity of the Stokes equations that the sought boundary 
condition is the unique solution of: 


K(u,) Ci eh cay g? 


because of the continuity of the stresses along 2. 


Thus the numerical resolution of time dependent Navier- 
Stokes equations requires at each time step four stages: 


A - Calculation of Sy, the right hand side resulting from 
advection; 


B - Calculation of the stresses jump along the junction by 
solution of a Stokes problem on each subdomain; 


C - Inversion of the discretized matching operator (which is a 
symmetric positive definite matrix whose rank is equal to 
the number of velocity degrees of freedom on I) to obtain 
33 

D - Calculation of the final velocity and pressure fields on 
each subdomain with the previously computed boundary 
condition. 


Note: Steps A, B and D which represent more than 95% of CPU 
cost, can be carried out in parallel on multi-processor 
computers; only step C requires a task synchronization. 


As regard to the good properties of the matching matrix K, 
it is advisable to use an iterative solver like conjugate 
gradient with scaling, which avoids assembling, factorization 
and storage of a full matrix. 


Many numerical tests of domain decomposition applied to the 
Navier-Stokes and shallow water equations (cf. Cahcuet, (1985)) 
have proved the strength and efficiency of these methods. The 
first example in figure 2 compares to a reference solution 
(global computation of the << driven cavity >> flow) the 
velocity and pressure fields obtained with a 3 subdomain 
decomposition using a Taylor-Hood finite element discretization. 
Then in figure 3 is considered the Stokes flow around a 
cylinder, a configuration which allows an important regular 
subdomain (75% of the whole domain). 


112 BENQUE ET AL. 


These tests and many others show that domain matching 
remains costly if the regularity of the subdomains is not 
exploited about five time more in CPU cost for half memory 
used. This ratio decreases when the size of the mesh grows 
(due to the growth of the time consumed for a linear system 
solution) and these methods become fully competitive for 
industrial cases, if the use of regularity can provide a 
reduction by a factor ten of CPU cost on regular subdomains. 


Global computation Computation by subdomains 


NAVYS SS e- EESE 
Rá Pda ma a A AA toe aad 
ENON 


Velocity 


Pressure 


Fig. 2 Solving the Stokes problem with domain decomposition: 
the wall driven cavity test case 
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5. TREATMENT OF TURBULENCE MODELS 
a) Main difficulties and adopted solutions 


The two main difficulties due to turbulence modelling for a 
3 D finite element code are: 


> 
+ firstly the recalculation of the matrix due to the change of 
the viscosity field at each time step, 


+ secondly the use of a wall function in order to avoid the 
need for an excessively thin grid along the wall due to the 
boundary layer. 


The first difficulty is simply solved by use of semi explicit 
treatment of the diffusive terms: every p time steps the matrix 
is assembled; between each reassembly the evolution of 
diffusive terms is taken into account by an explicit term on the 
right-hand side of the system to be solved: 

n n 

n o n o 

ve âu = ve A Lik Me ae yA u. (5.1) 
implicit explicit 
treatment treatment 


The second difficulty is related to the velocity boundary 
conditions along the wall. These conditions express: 
Firstly, impermeability: 
un = 0. (5.2a) 


Secondly, a linear relation between the shear stress and the 
tangential velocity: 


aea a a: A P = BS (5.2b) 


In (5.2) n and T are respectively the normal and the tangent 
to the wall, Ve is the turbulent viscosity, a and ß are local 


parameters which depend on the turbulence model. These 

boundary conditions naturally couple the velocity components in 
general; to avoid this coupling we Propose another formulation 
where Dirichlet conditions q on the wall T are sought in order 


to verify (5.2). 


In a similar way as for domain decomposition, this 
formulation leads to the minimization of a quadratic functional 
on q along the boundary Ta (cf. Goussebaile et al., (1984)). A 
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conjugate gradient algorithm, of the following form, is then 
justified: 


For a given estimate of q, qa along the boundary re 


. Firstly compute by UZAWA algorithm the associated velocity 
and pressure fields solutions of (3.la) and (3.1b), 


. Secondly compute the boundary error in the profile along uss 
using (5.2) and deduce a new estimate e CAPETE of q 


. Thirdly iterate until convergence on (5.2b). 


In this form the algorithm is still expensive so we have 
empirically developed an alternative which consists in going 
only once through the UZAWA algorithm for pressure at each 
iteration on q. This alternative form is then the following: 


ulda P,,) > Pete. > ug, abi Pa) 


+ divergence error > u( TE] ) 
Sas OG I’ Pme 


The algorithm stays very close to the original UZAWA 
algorithm and saves much CPU time, its efficiency seems to be 
due to the relative independence between pressure and 
tangential component of the normal stress. 


Before introducing any refined eddy viscosity model, our 
purpose was to study the behaviour of this algorithm. In the 
test case, presented in figure 5, a simple mixing length 
turbulence model and boundary conditions such as (5.2) on the 
walls were used. The computation of the eddy viscosity through 
more complex models (k - €) does not introduce any new 
difficulty compared to the first approach. Test cases concern 
inflow through tubes cluster (fig. 5) and two cases have been 
treated: smooth wall (pure slip conditions (a = O, 8 = O 
along tubes in (5.2) and rough wall (a = 20, B = O along tubes 
in (5.2)). The influence of æ is very important for the 
structure of flow; a rough wall implies development of large 
vortices between tubes; the influence of turbulence modelling 


is very limited due to the small Reynolds number (107) chosen 
in this test. We notice in figure 4 the quality of convergence 
of this algorithm concerning divergence error and velocity 
boundary error whatever the value of 4. 
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Fig. 4 Convergence of the preconditioned UZAWA algorithm 
associated with the treatment of the wall function 
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Fig. 5 Flow Through a Tubes Cluster 
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b) Turbulence modelling. The standard k - € model 

The turbulence model used in the computations is the k - € 
eddy viscosity model, in which the eddy viscosity Ve and the 
eddy diffusivity Kr are related to the turbulent kinetic 
energy per unit of mass, k, and its dissipation rate, €, by: 


pd $ SIPE 27. 3 2 
vy, = cul vI[n] = C vk. Vk" /e = Ck /ei Ky ok Je (5.3) 


The transport-diffusion equations for computation of k and e€ 
are written as follows: 


ake Salve =v 2 Wk) ESO le (5.4a) 
ot ~ Oy 
de ne vt e ; rs © 
seit u.Ve = Viet Ve) + k (Cy IP + Coy (1 C.3)G Co 98) 
(5.4b) 
ðu ðu ðu, 
Å plat eae: Baie Se fi if 2T 
with IP = vt ix, x, + Ox, ) and G = - Bg Kr 3z 


IP and G are source terms due to the shear stress (P) and 
thermal effects (G). 


Following LAUNDER and SPALDING (1974), the constants are: 


Cu=0.09 CH p=0.- 09 o,=1 o 51-3 Ce,=1.44 Co =1. 92 


k $ 2 


The choice of the constant CE3 is discussed further. 


Following Rodi (1980), the equation for k near walls reduce< 
a local equilibrium between production iP and dissipation €; we 


deduce from the wall shear stress expression ut boundary 
conditions on k and £ along the walls: 


Hence k= ur? / ycu (5.5) 


From the logarithmic condition on the velocity profile near the 
walls: 


E) (5.6) 
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with d: distance to the wall; K: Karman constant, E: roughness 
parameter we deduce << boundary condition >> on € << near >> 
walls: 


e= u*?/ (K.a) (5:7) 


Boundary conditions on velocity along the walls are the 
following: 


u . n = 0 
ðu (5.8) 
i A a u* [u+] 


The variables k and € are treated quadratically. Actually a 
standard explicit treatment of the source terms in the k and € 
equations being unstable, these terms are upstreamed by 
integration along the characteristics during the advective 
step. 


Finally in the case of k - e turbulence modelling the 
algorithm becomes the following: 


At each time step t’: 


Firstly deduce the eddy viscosity Ve and the eddy diffusivity 
Oe by projecting EEr during this step YE and Ga are treated 


up to now as quadratic unknowns at the left hand side and k and 
€ are computed explicitly at the right hand side. 


Secondly compute u* along the walls by solving iteratively (5.6) 
at each wall node using the explicit values of the velocity. 


Thirdly compute the advection of all the variables; during this 
step the source terms of the k and € equations are integrated 
along the characteristics. 


Fourth compute the diffusion of all the variables other than 
velocity; at this point the temperature, the kinetic energy 


and the dissipation at time t are known. 


Fifth solve the Stokes problem; this is accomplished by the 
previous algorithm which allows the decoupling of each velocity 
component despite the wall conditions. 


A first illustration of this algorithm is given by the test 
case of the 2 D'BOYLE and GOLAY' CELL; this test case was 
submitted at the last AIRH working group on refined flow 
mođelling stated in Cadarache - France, January, 1985. The set 
of boundary conditions given on figure 6 were requested. 
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In figure 7, a comparison between k profiles, at different 
levels of the cell, obtained by several authors have been done 
by M. GRANDOTTO and C. BOUFFINIER, organizers of the Meeting. 
We have added our own results; they are in good accordance with 
the global set. The peaks in k occur in the regions where the 
velocity gradients are steepest, since k is produced by mean 
flow shear. But as observed by BOYLE and GOLAY (1983), the 
calculated k distributions differ significantly from the 
experimental data on the upflow side of the vortex; the 
magnitude is overestimated and decreases with the elevation of 
the jet which is contrary to the experiments. Near the outer 
wall we observe a decay of k along the downflow streamline in 
the same fashion as for experiments. This decay is due to the 
fact that in this region the dissipation rate of k has risen to 
the point where net production balance is negative. This test 
case points out some well known defect of the k - € model in 
jet flow; on the other hand it allows a validation of our 
numerical treatment for this model. 


6. INDUSTRIAL COMPUTATIONS 


According to the earlier splitting up techniques, a three 
dimensional Navier-Stokes finite elements code N 3 S (Navier- 
Stokes, 3 D, Splitting) is under development at E.D.F. 
(Gregoire and Goussebaile, 1985). Up to now this code allowed 
the treatment of laminar Navier-Stokes equations associated 
with the advection-diffusion of a passive scalar. Recently a 
simple mixing length turbulence model has been implemented. 


In the near future our aim is to treat an active scalar 
(Boussinesq approximation) and besidesk - € turbulence model 
associated with logarithmic boundary conditions. N 3 S will 
then be an efficient numerical tool for simulating industrial 
flows. 


Up to now the main result concerns a laminar flow behind a 
cooling tower, associated with the advection-diffusion of a 
pollutant at the outlet of the tower. The main characteristics 
of this computation are summarized in Fig. 8; its main 
interest is the strongly three dimensional character of the 
flow in a relatively simple geometry. 
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Fig. 8 Jet in a Crossing Wind 


Mesh: 2844 tetrahedrons, 4653 nodes 
Boundary conditions: 


crossing wind velocity at the inlet of the domain : 36v/D 
64v/D 


velocity at the outlet of the cooling tower 
purely slip conditions at the top of the domain 
no slip conditions at the bottom of the domain 


. free outlet of the domain 


Computer: Cray l 
CPU time for 1 time step: 
Advection: 2.0 s Stokes problem: 11.3 s 


Core requirements for matrix 


Uzawa solver-packing mode: 0.4 million words. 


From a physical point of view despite a rough mesh the 
computation reproduces quite well the shearing of the jet, due 
to the crossing wind. One can notice on Fig. 9 the ascent of 
the central part of the jet, while at the same time, the 
peripherical part is deflected by the wind; the computation 
reproduces the typical formation of rolls due to the shearing 
and to the deflection. Furthermore one can notice the 
separation of the flow and the recirculation behind the tower 
(figures 9, 10). The evolution of the pollutant essentially 
produced by the advection shows on figure 11 the evolution in 
<< horseshoe >> of the jet. All these reults concerning a 
complex three dimensional flow are encouraging for the near 
future capabilities of N 3 S. 
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Fig. 9 and 10. 
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Fig. 11 Cooling Tower in a crossing wind:Advection - Diffusion 
of a pollutant 


From the numerical point of view, the core requirements on 
Cray 1 is 0.4 million words for Uzawa algorithm using packing 
mode; the inner solver for linear systems is then a conjugate 
gradient preconditioned by an incomplete Cholesky factorization. 
This value is to be compared with the 17 million words needed 
by the global solver previously used on the same example: this 
solver used the Cholesky factorization of the global Stokes 
problem discretization and the resolution was made by blocks. 
Despite achieving a better numerical precision, all other things 
being equal, by use of a direct solver, the Uzawa algorithm with 
packing mode will be preferred: the loss of accuracy results 
from the divergence, for which great precision in 
thermohydraulic problems is not required; the error 
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divergence has naturally a great influence on pressure, but a 
moderate one, if reasonable, on velocity which is the quantity 
of major interest. Anyway the Uzawa algorithm, less precise 
than the direct solver, nevertheless improves the quality of 
divergence in comparison with solvers previously used such as 
Chorin-Temam algorithm as we have seen before. On the other 
hand, the Uzawa algorithm allows a drastic reduction of core 
requirements for industrial computations. Up to now the CPU 
time for 1 time step in the computation is 2 seconds for 
advection and 11.3 s for Stokes problem (4 iterations of 
Uzawa algorithm with packing mode); in the near future a Cray 
XMP will take the place of Cray 1 and by use of a cabled 
gather-scatter the CPU time for Stokes problem will decrease 
down to 6 s: a similar time to that of 3 D finite difference 
codes intensively used today at LNH with the Uzawa algorithm. 
These performances are very encouraging for the future 
capability of N 3 S. 


7. CONCLUSIONS 


As we stated earlier, the numerical simulation of industrial 
flows requires the consideration of a large number of variables 
in complex geometric patterns. The use of a finite element 
discretization allows a natural treatment of complex geometry 
with local refinements; the use of fast transparent splitting 
algorithms allows realistic simulations of industrial flows; 
the use of domain decomposition could open the way to an 
efficient use of parallel computers taking advantage of the 
regularity. 


Finally all these splitting up techniques allow the 
numerical simulation of industrial flows. It is the conviction 
of the authors that they will be in many respects particularly 
useful for fully 3 D complex flow simulations of high quality. 
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SPECTRAL METHODS FOR DISCONTINUOUS PROBLEMS 


S. Abarbanel, D. Gottlieb and E. Tadmor 
(Tel-Aviv University, Tel-Aviv, Israel 
and 
Institute for Computer Applications in 
Science and Engineering, USA) 


l. INTRODUCTION 


We show that spectral methods yield high-order accuracy 
even when applied to problems with discontinuities, though 
not in the sense of pointwise accuracy. Two different 
procedures are presented which recover pointwise accurate 
approximations from the spectral calculations. 


Consider the evolution partial differential equation 


ue = Lu, on a finite interval, where L is a hyperbolic 


operator. The solution u has a projection Py u on a finite 


subspace (which may for example consist of the first N modes 
in a Galerkin method, or N collocating points in the interval), 
and a numerical approximation wy generated by some spectral 


method. For linear operators it is known from the Lax 
equivalence theorem that if the scheme is consistent and 
stable, then Uy approximates Py u in some appropriate norm. 


If u is smooth, then the theorem implies that uy approximates 


the solution u in the same sense. 


In practice, one looks at the point values of Uy at the 


grid points and takes it as an approximation to the values of 
the true solution u at these points. We shall call this 
approach the realization of the computed solution via its 
grid-points value. The aims of the paper are: 1) demonstrate 
that when u is a complicated function, this realization will 
not produce acceptable results; 2) to suggest different ways 
for the realization of the solution in such cases. 
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The following examples give a very clear illustration of 
the misleading results that may be obtained by pointwise 
realization. 


Example 1 
Consider the equation 


u =u Osx 27 
T x 


(1.1) 
u(x,0) = Uy (x) 


where u(x) and uy (x) are periodic functions and U, (x) isa 


discontinuous function. If we expand Uy (x) in a Fourier series 
we get 


Uy (x) = i ESA (1.2a) 
where 
2. = Ł i u oe i, (1.2b) 
The solution u(x) is thus given by 
œ 
u(x) = 2È ak ane eat 


k=-0 


Suppose that (1.1) is solved numerically by the Fourier-Galerkin 
method, namely we seek a trigonometric polynomial of the form 


N 
wt) = E b (te 
k=-N 
that satisfies 
a 3 
Se ae i =O, NsksN 
(1.3) 
N : 
By (XO) = Ea ee, 


k=-N 
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From (1.3) it is clear that 


db, (t) 
-ae = ‘ik b(t), - Nekeun (1.4) 


and 


b (0) = a 
yielding the solution 
ikt 
b (t) idle E 


Therefore 


N 

ikt ik. 

u(x,t) = E alee. (1.5) 
k=-N 

Equation (1.5) implies that uy (xr t) , obtained from the 
numerical solution (1.3), coincides with Py u(x,t), the 
Galerkin projection of u, thus yielding the best possible 
convergence of uy to Pue: However, since the Fourier series 
of u(x,t) converges very slowly, the point values uy (xt) 
will not approximate well lacie) In general, one would 
witness the Gibbs phenomenon of overshoot in the neighbourhood 
of the discontinuity and global oscillations all over the 
domain. In fact, even the initial approximation, wy (x10), 


displays the same behaviour in relation to Uy (x) 5 


In the second example we show that the same phenomenon 
occurs even if the numerical initial point values do 
approximate the true initial point values to a high degree 
of accuracy. 


Example 2 
Consider the equation (1.1) where Uy (x) is the saw-tooth 
function ax MES 
u (2%) = (1.6) 


A(2x - 7) x> x 
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i l 
for some k, O < k < 2N-1, x S (kt >). 


In the pseudospectral Fourier method we seek a 
trigonometric polynomial Vy (Xt) 


N 
vy(xrt) = E b, (tje t* (1.7) 
L=-N 
such that 
ov. Ov, 
ga a agi = E 
FE ox at the points = RGT j = O,...,2N-1 (1.8a) 
WO = U, (x, x) (1. 8b) 


Since Vy is a polynomial of degree N, (1.8a) implies that 


ðv, 


avy MSS 
ax 


at 


(1.9) 


for all x€ (0,21). Moreover, from (1.8b) it is clear that 
Vy (X70) is the (unique) trigonometric polynomial of order N 


that interpolates Uy (x) at the points xjr j = O,...,2N-l; thus 


vW% = E a, (ei * = AF (x,x) (1.10) 
2=-N N 
where 
F 1 2- E 
ap (5) m E u (x,:x)e à (1.11) 
2 2N 
cy j=0 Ones 


Performing (1.11) we get 


a (ie) = a [Ik - n+ .5] (1.12) 
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= (+1) 
=o S T l-e p LEA 
a, (x) “one, ? Zin +ictn--1,2#0 
N 
l-e (1.13) 
where 
Cig = GisiZy N |x| #N. 


The numerical solution Vy (Xr t) Of (ITS) y LO) is 


vO = Vy (* + t,0) = AF (x + t,x) (1.14) 
and upon manipulating (1.12), (1.13) one gets 
Vy (xt) = AF, (x,x - t) + At. (1.15) 


The trigonometric interpolant By (7%) collocates u (1X) at 
the grid points ssn However, in between the grid points it 
oscillates. If we read the values of vt at the grid 


points, then by (1.14) 
Wee = AF, (*, + t,x) 


and unless t = = for some integer m, we will get a solution 


that looks oscillatory. Thus, even though the initial 
appromixation looks smooth at the grid points, when it evolves 
in time the oscillations will present themselves at the points 


Xe 


The conclusion one might draw from the above examples is 
that spectral methods (or any higher-order methods) are useless 
when applied to discontinuous functions. A different approach 
is to look at a different realization of the numerical solution 
rather than the pointwise one. We will argue that high-order 
accurate information is contained in the numerical solution 
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and demonstrate how that information can be extracted in such 
a way that accurate pointwise approximation to the true 
solution can be obtained. 

2. INFORMATION AND HOW TO EXTRACT IT 


Consider the linear equation 


u = Lu 
E (2.1) 
u(0) = YU 


where L is a linear hyperbolic operator with variable 
coefficients and uy is a discontinuous function. For 


simplicity, we will restrict ourselves to a periodic one 
(space) dimensional problem, though the results are more 
general, (see Gottlieb and Tadmor (1985)). Let v be the 
solution of the auxiliary problem 


(2.2) 
v(0) = v 


where Vo is a ai function. Because of the hyperbolicity of 
L, (2.2) is a well-posed problem. In Lemma 1 we quote the 
well-known Green's identity. 


Lemma l: Let u(t) and v(t) be the solutions of (2.1) and 
(2.2) at some level t, then 


(u(t), v(t)) = (U5 1VQ) (2.3) 


Assume now that (2.1) and (2.2) are discretized by the 
Fourier-Galerkin method. That is, we seek uy and Yy that are 


ae polynomials of degree N such that for every 
kr |k] ¢ N 


a 
ik 
ae e | =0 (2.4a) 
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[ay 109 -uy i] =0 (2. 4b) 
ov. 
ikx N, * uy 
[e ' (GE +L v] =0 (2. 4c) 
ikx 
e, [wo - vo) =0. (2.44) 


We have also a Green's identity for uy and K 


Lemma 2: 


[KEE] = (Ovo). (2.5) 


Proof: Since Vy (t) and uy (t) are no order trigonometric 


polynomials we use (2.4a) and (2.4c) to get 


a 
(a= my WTO 


and therefore 
e (wa) = Ew) > (w w) =o 


which implies (2.5). 


We will proceed by showing the relation of the RHS of (2.5) 
to that of (2.3). 


Lemma 3: 


ESO) ©) = [too] te (2.6) 
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whe re 
ly Il 
Ee des (2.7) 
N 
for every s. 
Proof: From (2.4c) it is clear that 
- = 0. 2.8 
(40 Yo! wO) o (2.8) 
Also, 
yg Hull i - vil 
IAG vo] x Klug Vy (0) Yo 
o 
and since Yo isa C function, 
ly l 
lv (© - vo! Š% K S for every s. (2.9) 
N 


Now 
[K O. ¥y'0)] = (ugrv) + [O E A ¥y()} R [oO - vo) 
and in view of (2.8) and (2.9) 


(20), vyo) = uro) +e) 


where 
Ny ll 
Os 


e€ ERK 
le] aame” 
and this proves the Lemma. 

From Lemmas 1 - 3 we can conclude: 


Theorem l: Let u(t) and v(t) be the solutions of (2.1) and 
(2.2), respectively. Let wy fe) and Vy (4) be the solutions of 


the Fourier-Galerkin approximations of (2.1) and (2.2). Then 
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Iy ll 

Os 

TEG for every s. 

N (2.10) 


$K 


| (Me y‘) = face» v(t) ] 


The proof is an immediate consequence of (2.3), (2.5) and (2.6). 

Assume now that the Fourier-Galerkin method described in 
(2.4c) and (2.4d) is stable; then vy (t) approximates v(t) 
within spectral accuracy, that is 


» dvi 
s 


Iv (t) - v(t)l = E $K 


nevi 


We can, therefore, replace v(t in (2.10) and get 


(ay (2) vied] = (ce) vied} +e 


where € is spectrally small. We use now the fact that every 

C” function v(t) can be obtained from some Y in (2.2). This 
is, in fact, one of the definitions of hyperbolicity. We can, 
therefore, state: 

Theorem 2: Let u(t) be the (nonsmooth) solution of (2.1) and 
let wy (t) be the solution of the spectral Galerkin approximation 
to (2.1). Then for any c function v(t) 


(ay(t) vied} = (ater vie) + e (2.11) 


where € is spectrally small. 


Thus, uy (t) approximates weakly u(t) within spectral 
accuracy. It is in this sense that uy (t) contains highly 


accurate information about u(t). We will show later how to 
use this information in order to obtain spectral accurate 
approximation to the grid-point values of u(t). 


We turn now to the pseudospectral Fourier case. Here we 
need some preprocessing of the initial data in order to prove 
the same result as in Theorem 2. 


Theorem 3: Let yy (ert) be a trigonometric polynomial of order 
N that satisfies 
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mæ 7 at x = ay nee mine r j = 0,..<62N-1 
(2.12) 


(ay -u,e =0, |x| <N, 


ikx 
AN 


(ie., uy (x,t) is the solution of the pseudospectral Fourier 
scheme, but initially uy (x70) is obtained by the Galerkin 


projection) . 


Then for every smooth function u(x,t) 


i 2N-1 20 
5 2 Wi lxy7t) vixyet) = [ u(x,t) v(x,t) dx +e (2.13) 
N j= N j j o 


where € is spectrally small, provided that the pseudospectral 
approximation is stable. 


Proof: Let VN be the solution of the pseudospectral Fourier 
approximation of (2.4a) and let vy (0) be the Galerkin 


projection of Yor that is 


=o, |k| SN: (2.14) 


From (2.12) and the analog equation for Vyr one gets 


* 2N-1 i N 
me (x,,t) vi (x,,t) == £ (x, ,0)v.(x,,0). (2.15) 
NG, Uy 5 NG By 5197 May 5 


From the exactness of the trapezoidal rule for polynomials of 
degree 2N, we conclude 


la 


2N-1 {" 


Z lxt) glt) = | uy (0) vyl% oax = (4y (0) +%(0)] 


ion 


o 
(2.16) 
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Note that the initial functions Vy (X70) and uy (x70) are not 


the interpolants of YU and Yo as in the usual pseudospectral 


methods but rather the Galerkin approximation to to and Yo: 


We recall now Lemma 3 and equality (2.3) to establish (2.13). 
The proof is thus completed. 


It is interesting to note the way in which the information 
is contained. The interpolant of YU looks smooth at the grid 
points, whereas the Galerkin approximation of YU looks 


oscillatory on the grid points. It means that in order to 
preserve the information one has to require initially 
oscillatory-looking solution. The information is preserved 
in the structure of the oscillations. 


We will show now a way of using (2.11) and (2.13) in order 
to construct a better approximation to Bee) than the one 


given by uy (x5 /t) . (Here Uy (x,t) is given by either the 
Galerkin method or the pseudospectral method). 
From (2.13) and (2.11) it is clear that in order to get 


a good approximation to u(y,t) at some point y € (0,27), 
we need to find a function Vy (x,t) such that 


2T 
f u(x,t) v (x,t)dx = u(y,t) + Eyr 
o X: 


where € is spectrally small. By (2.11) we will have 
27 
Ik wy (x,t) Vy (ert) dx = uly,t) te +E (2.17) 


for the Galerkin method and 


2N-1 
g Uy A YEIN) =u(y,t) +€ +€ (2.18) 


T 
N 
for the pseudospectral method. 


For convenience we shift the interval [0,21] to [-1,7]. 


Let p(x) be a c -function vanishing outside the interval 
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(-1,7] satisfying 
e(O) = 1. (2.19) 


Let pasa) be the Dirichlet kernel, namely 


THE ikx _1 sin (p + yy. (2.20) 
Dp (y) URT e uy 2m sin (y/2) 
We set now 
v(x) = y%Pix) = et p(8'y)D_(y). (2.21) 


One can prove as in (Gottlieb et al., 1983) that 


(2.22) 


H 8 
f u(x) y "P (y-x)āx = u(y) + £, 
-T 


where €, is spectrally small. 


Thus, it is possible to extract accurate pointwise values 
from u% . 


3. NUMERICAL RESULTS 


In this section we demonstrate the efficacy of the smoothing 
procedure outlined above. As a test function we have chosen 


the piecewise c -function 


sin 


NX 


f(x) = (3.1) 
-sin 2 T'EREN 
Denote its spectral approximation by £,(x), and let £y (x) be 


the pseudospectral approximation to f(x). „It is evident from 
the first column of Tables I and III that £,(y,,) 7 the spectral 


approximation sampled at sA = vn/N - do not approximate f Yy) 
within spectral accuracy. In fact, the error committed by 
fiag'¥,) is only half of that committed by fog Yy) 3 
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Regarding the pseudospectral approximation, E(x) mits of 


course, collocates the exact values at the sampling grid points, 
Eyl) = f(y ); yet, in between these gridpoints, 


ty i +3 5 


(v + wa) approximate f Y Fa) within first-order 
accuracy only, as shown in the first column of Tables II and Iv. 


In order to construct our regularization kernel in (2.22) 7 


we define the cut-off function p(¢) = O(c) to be 


ex eE lel <2 


, (3.2) 


o otherwise 


œ 
namely, Pa is a C - function whose support is the interval 


Izl < l. Ņ to be used in (2.21) is of the form 


PP =o (ot y) Sa et ayes (3-3) 


279 sin y/20 


The post-processing procedure of the spectral approximation 


zt involves convoluting Ee against yer, namely 


sin (x-y) /20 


2T 3 
Fe Af vr PX] sin (p + »)y/0_ 4, (G32) 


where x is a fixed point of interest. (In practice, we use 
the trapezoidal rule to evaluate the right-hand-side of (3.4) 
taking a large number of quadrature points). 


The parameter 0 was chosen as 
6 =n-|x - r|; (3.5) 


this guarantees that y is so localized that it does not 
interact with regions of discontinuity. 


It should be noted; in this stage, that if 0 was so chosen 
to be the same for each x, (and not as in (3.5), the formula 


142 ABARBANEL, GOTTLIEB AND TADMOR 


(3.4) admits a simpler form; that is, if 


Pys £ A ety (3.6) 
ka-0 
then 
N ikx 
EUSTE £(k)o, e (3.7) 
k=-N 


This procedure can be carried out efficiently in the Fourier 
space. 


Next, we turn to the post-processing for the pseudospectral 
approximation £y (x) which is simpler than (3.4). In fact, 


in this case 


om A-L i 
£(x) ~ <2 5 Ey py ‘Pix-y,). (3.8) 
v=0 


Note that carrying out the smoothing procedure defined in 
(3.8) does not involve any extra evaluation of f(y) in points 
other than y , in contrast to spectral smoothing procedures in 
(3.4). As before, the Parameter @ was chosen according to 
(3.5). We have yet to determine the Parameters p and a. The 


Parameter p must be equal to nÊ for O < B < 1, in order to 
assure infinite accuracy. (In our computations, B ~ .8.) 
Finally, we feel that a is problem dependent and we chose 
a = 10. We have not tuned the parameters to get optimal 
results; further tuning may improve the quality of our 
filtering procedure. 


In Tables I, II, III, and IV we give the results of the 
smoothing procedure at several points in the domain. The 
pointwise values are now recovered with high accuracy. The 
first column in each table indicates the points in which the 
Procedure was performed. We limited ourselves to four points 
in the interval (0,1) because of the symmetry of the function | 
f(x). 


$ The second column gives either the spectral approximation 
E09 or the pseudospectral approximation Ey , N = 128 in 
D 


Tables I and II and N = 64 in Tables III and IV. -The third 
colum gives the smoothed results, when filtered by (3.4) 
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on (3.8), at the same points as in column I. 


The results indicate the dramatic improvement obtained 
by the smoothing procedure. Moreover, note that the error 
committed by £108 (or £159) is better than ‘the one committed by 


toa (or #2) only by a factor of 2 whereas after the 


post-processing the error improves by a factor of 104. 


Table I. Results of smoothing of the spectral 
approximation of f(x), N = 128 


aaas 


x 5 lE - i) FERE] 
v equals at x- x) 
tt AED MINN eee O 

2 3.2(-3) 5.8(-10) 

3 5.2(-3) 7.9(-10) 

4 7.8(-3) 6.3(-10) 

5 Ipal(=2) 1.1(-10) 


Table II. Same as Table I for the pseudospectral 
approximation £y (0) e 


it 3 
sra a EE a E a ae a 
v equals Lhe 28 EAE 

2 5(-3) 7(-10) 

3 8.1(-3) 7.9(-10) 

4 1.2(-2) 6.4(-10) 

5 1.8(-2) 1.2(-10) 
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Table III. Results of smoothing of the spectral 
approximation of f(x), N = 64 


=E lex) - Ex) | £- f+ | 
v equals 
2 6.4(-3) 4.8(-6) 
3 1(-2) 5.9(-6) 
4 1.5(-2) 7.7(-6) 
5 2.3(-2) 8.9(=6) 


Table IV. Same as Table III for the pseudospectral 
approximation, f(x) 


te, = 5 (vth ) JE ) rete ad aes a | 
v equals at x = Fiti 
2 1(-2) 4.1(-6) 
3 1,6(-2) 6(-6) 
4 2.4(-2) 7.8(-6) 
5 3.6(-2) 8.9(-6) 


4. A DIFFERENT METHOD FOR EXTRACTING INFORMATION 


In this section we would like to present a different 
approach for extracting the information from an oscillatory 
solution. The idea is to subtract from the solution those 
oscillations that correspond to the saw-tooth function 
discussed in Example 2. This leads to the following 
procedure: 
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N 
a ik 
Let uy (x,t) = 2 ue * be the solution of the 
2=-N 
pseudospectral approximation to a hyperbolic problem. We 
try to find an unknown smooth function and a (oscillatory) 
saw-tooth function Fy (x-t, x.) with an unknown jump 27A at an 


unknown location Xs such that 


2N-1 P an a 
H = E Uy (5 t) - AF y (X5 1%.) -c- E by e 3 (4.1) 
j=0 2=-p 
270 


is minimized. Note that we have 2p + 3 unknowns in (4.1): 
A, X c and 2P values of by (2 #0). 


The conditions for local minima of H are found from 
the following 2p + 3 equations: 


2N-1 P 
ik 
A i 8 far, om en ey E E Si) = 0 
3A F j j ape 
jo P (4.2) 
2#0 


where 23 = Buy (065 0) , uy = Uy (65 t) . Also, 


2N-1 p ilx 
OHS ore a a a Deeb. e Sao \(4.3) 
ðc rp 2 
j=0 P 
240 
9H 2N-1 P ix 2 
“=o ==> 4 -aF’ F, -cF7 - FL E be =0 
əs ja Sa E A j j dep £ 
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N da, (s) es 


where Fy = oF, xjr x 05 = pae 3s -e ; and 


E o e b i aa aah a uses” ias) 
m m m 


ab 
n 
where 
a 2 NR Ee 2, 
m Ca j=o 


Substituting (4.5) into (4.2), (4.3) anā (4.4) we get, 
respectively: 


where a“(s) = da, (x) /as. Next, we combine (4.7) and (4.8) to 


get a single nonlinear equation for s: 
uy Zey ay ay - te, ay Uy Be, ae, a, = 0 (4.9) 


where all sums run over p < |2| <N. 


Equation (4,9) is solved iteratively for s. Having 
found s, one immediately obtains from Example 2 all a, (s)“s. 


Then from (4.5) we have the by Sr and A from (4.7). Finally, 
having A we find c from (4.6). 
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The minimum thus obtained may be a local one while we are 
seeking a global minimum. This means that in practice one 
searches for the global minimun. 


We now give an example that illustrates the efficacy of 
the procedure. We solve the following problem: 


d ð 
siete: Oo<x<2m,t>O0O (4.10) 
x 
= EAE 
sin 2 
uy (x,0) = (4.11) 
-sin = ™< x § 20 
2 
uy (O,t)= uy (2m,t)- (4.12) 


We ran the problem on several grids and exhibit here the 
numerical results for the case N = 8 (ie., 16 subintervals 
in the domain (0,2m)). ‘The unadulterated results at t = n/2N 
on the grid points are shown in Figure l. 

Exact solution 


x Unsmoothed pseudo- 
spectral solution 


o Smoothed solution 
(N=8) 


x 


x Fig. 1 
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Table v 
j exact error 1 = error 2 = error 1 
Solution |exact-unsmoothed| |exact- smoothed error 2 
2 -5 5 
O 9.80 x 10 5.86 x 10 5.86 x 10 1.00 
Eo =2 -5 
1 9.80 x lo 1.24 x 10 5.86 x 10 211 
Si 2 -5 
2 2.90 x lo 2.57 x 10 6.30 x 10 408 
SAITA ona A k nore 7.330? 563 
eat 33 -5 
4 6.34 x 10 6.15 x 10 9.30 x 10 661 
s zi 2 -4 
5 7.73 x 10 9.11 x lo 1.31 x lo 695 
6 882x101 1.43 x 1072 2.16 x 1074 662 
=a al -4 
7 9.57 x lo 2.70 x 10 4.42 x lo 611 
8 -9.95 x 107} 1.00 x 10° 1.10 x 10`? 91 
-1 al 23 
9 -9.95 x 10 2.68 x 10 1.34 x 10 200 
-1 E -4 
10 -9.57 x 10 1.42 x 10 4.42 x 10 321 
=I. =2 -4 
11 -8.82 x 10 9.07 x 10 2.16 x 10 420 
e =2 -4 
12 -7.73 x 10 6.12 x 10 1.32 x lo 464 
=I 3} -5 
13 -6.34 x 10 4.11 x lo 9.30 x 10 442 
14 -4.71 x 10+ 2.55 x 1072 7.32 x 105 348 
= =o -5 
15 -2.90 x 10 1.22 x 10 6.30 x 10 194 


We then post-processed these uy (4, 77/20) values according 


to the procedure described above. The filtered values are 
shown on the same graph, and the errors listed in Table V are 
computed before and after processing. The dramatic improvement 
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is evident. 


Next we demonstrate the procedure in the case of the Euler 
equation for gas dynamics. Because the physical problem 
involves inflow, outflow, and no-flow boundary conditions, 
periodicity could not be imposed and we use the Tchebyshev, 
rather than Fourier, pseudospectral method. 


The physical problem is that of a wedge, inserted at a 
zero angle of attack, into a uniform supersonic flow of an 
ideal gas with y = 1.4. An oblique shock develops in time 
and the flow reaches, after a while, a steady state. The 
time-dependent Euler equations in two-space dimensions were 
discretized by the pseudospectral Tchebyshev method in space 
with an 8x8 grid and a modified Euler scheme was used for the 
time discretization. Since we are interested in the steady 
state only, the accuracy for the time integration is of little 
importance. In order to be sure that a steady state is reached, 
the code was run until all physical quantities did not change 
to 1l significant figures over a span of 100 time steps. The 
values of the density in the steady state at the grid points 
together with the grid points themselves are given in Table VI. 


Table VI 


1.865 1.885 
1.954 1.803 
1.770 
1.969 
1.703 


2.429 
3.375 
2.965 
3.108 


.691 
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Note that the raw data in Table VI seem to indicate roughly 


the same y-shock location at Xo =1, x= -961, and x3 = .853, 


namely between the grid points Vigne -3086 and Ys = .500. This 


Means that because of the coarse Tchebyshev grid the shock 
location cannot be resolved to better than 20% of the domain. 
In fact, the correct shock locations at those x-stations are 


y = .434 for pe Cai: +417 for x) and y = .370 for X3. 


In the present case it is not necessary to employ a 
saw-tooth piecewise smooth function, as was done in the 
previous section, because there is no need to preserve 
periodicity. Instead, we subtract from the oscillatory data 
an expansion of the Heaviside function, S(y,y,) $ 


where a, the state ahead of the shock, and ay, the magnitude 


of the discontinuity, are constant. The description here of 
S(yry.) , as if independent of x, has to do with the fact that 


the two-dimensional results of the pseudospectral algorithm 
were post-processed at fixed x stations. The expansion of 
S(y,y,) is given by 


N 
Sy(yry.) = E A) (s)T, (y) (4.14) 
N s 20 2 LA 


where T (y) is the Tchebyshev polynomial of order L, 


Ta (y) = cos[2% cos *(y)], and 


l 
A, (s) (s + 2 Jax 
me tas alee 1, n TA 
A(s) = sin Ẹ (s + bJa sin 3p? l $£ s N-1 


Ay (9?) = sin [is + d] 7m. 
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If s is an integer, then on the grid points, Yj = cos (mj/N), 


SKYY) = Slyjry,)- (4.15) 
The Lj-norm which we wish to minimize is now, at any given 
x-station: 
N p<N 2 
u= z + fo.) -4 Yaya) Z Pe Ty (yy) 
j=0 ej l fag | 1 3 2=1 
(4.16) 
l 1 £ j < Nel 
Ante (4.17) 
J 
2 JOEN 


Differentiating (4.16) with respect to the parameters a; ao, s 
and bl $ & s p < N), using the orthogonality relations for 


$ 


the Tchebyshev polynomials and manipulations similar to those 
used in the previous section, we get p + 3 nonlinear algebraic 


equations which are completely analogous to (4.5) - (4.8). 
They are: 
by R Po r a, Ay 2 = 1,2,.007P- (4,18) 
ees Mi = 4. 
fo a, Ay ay o (4.19) 
N z N 2 
> c, A, P Gi r c, A =O (4.20) 
E ANE NEN EE W 
N pt N 
> P-a E oA AP =O (4.21) 


L=ptl L=ptl 
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where 
N 
5 2 ik 
P, =—— IE — oly.) (y.) (4.22) 
2 Ne, jæ 5 j ~j 
Age ANA (8). (4.23) 
ask 


Again, we combine (4.20) and (4.21) into a single nonlinear 
equation for the shock location index, s: 


Beila = ic Ic A, AZ =O (4,24) 


Bey Ay Pg Eee Ay a Aa Py By Ay AQ 


where all the sums are from 2 = ptl tok =N. 


The procedure for extracting the shock location, jump 
magnitude and smooth part of the solution from the raw data 
P (x,y,) (given in Table VI) is exactly the same as described 


above for the Fourier problem. 


For the wedge-flow problem considered here, this procedure 
applied in the case of a coarse net (N = 8), located the shock 
with an error only in the fourth significant figure. The 
smooth part was recovered to within 1% at the worst field point. 


5. CONCLUSION 


We have demonstrated that the realization of a numerical 
solution via its grid-point value may be misleading when the 
true solution has a complicated structure which is not 
resolved by the grid. We have shown, however, that the 
numerical solution does contain highly accurate information 
about the solution and we suggested two ways of extracting 
this information, 


The analysis outlined in this paper is a linear one 
(though the procedure was applied also to nonlinear problems). 
However, Lax (1978) has argued that more information about the 
solution is contained in high resolution schemes even in the 
nonlinear case. In fact, using notions from information theory, 
Lax has shown that the €-capacity of the set of the set of 
approximate solutions is closer to the €-capacity of the set of 
the projections of exact solutions if the numerical scheme is 
a high-order scheme. 
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In the area of digital filters one always processes the 
data in order to overcome the Gibbs phenomenon. If we look 
at the initial conditions as an input signal and at the final 
result as the output signal, the idea of filtering is a 
natural one. 
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THE NUMERICAL SOLUTION OF BIFURCATION PROBLEMS WITH SYMMETRY 
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A.D. Jepson 
(Department of Computer Science, University of Toronto) 


and 


A. Spence 
(School of Mathematics, University of Bath) 


1. INTRODUCTION 


The steady-state behaviour of many physical systems can often 
be modelled by non-linear multiparameter equations of the form 


F(x,4,a) =O F:XxRxRP >X (1.1) 


where F is a smooth non-linear function, X is a Banach space, 
xeX is a state variable, \eR is the bifurcation parameter, and 


eR? (p 2 1) is a vector of control (or auxiliary) parameters. 
The distinction of the bifurcation parameter ìà is made for two 
reasons. First it is often the case that one parameter is 
"dynamical" in character and is varied quasistatically, while 
other parameters iare "geometrical" and:are less easily varied. 
Second the distinction of à is numerically convenient since it 
allows the use of the singularity theory of Golubitsky and 
Schaeffer (1985). (We remark that a good understanding of the 
solution set of (1.1) is often a prerequisite for understanding 
the solution behaviour of the time-dependent problem 


x, = F(x,A,@) , 


though we do not discuss this aspect any further in this paper). 


As an example (1.1) could be the system (3.13) in Cliffe and 
Spence (1985) arising from a finite-element discretization of 
the Navier-Stokes equations for steady axisymmetric flow in the 


Taylor problem. In this case X = Rae x represents the primitive 
variables velocity and pressure, à is the Reynolds number, and 


there are two control parameters a = (araa), with a the 


aspect ratio and a, the radius ratio. (N.B. in Cliffe and 
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Spence (,a) = (R,I,n)). The numerical strategy described in 
this paper was tested on the Taylor problem and numerical 
results are given in section 5 (see Fig. 4). In the experiment 


of Benjamin and Mullin (1981) the radius ratio a, was fixed at 


a constant value and so direct comparison with experimental 
results is not possible. We note however that numerical 
results for the case of a, fixed are given in Cliffe (1983) and 


Cliffe and Spence (1984) and show excellent agreement with the 
experimental results. As is standard practice Benjamin and 
Mullin describe their results in terms of bifurcation diagrams. 
They give graphs of solutions of the Navier-Stokes equations 
against R for various values of I, with n constant. Three 
such graphs are given in Fig. l. 


UZ (%,0) 


Ly 


(a) 


(b) 


(c) Cig 


Fig. 1 Bifurcation diagrams for the finite Taylor problem for 
various values of aspect ratio: a) a, = 1.24, 


b) a. = 1.26, ©) a, = 1.27. 


l l 
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Note that though the solution is x = (ua, uy? we p) a 


representative component of the solution (in this case 
u, (3/4, 0)) is used along the vertical axis. Because of the 


reflection symmetry in the problem (see Cliffe and Spence 
(1985), section 4) the two-cell flows have u, (3/4,0) = O whereas 


non-symmetric solutions have u, (3/4,0) + O. The point at which 


the two-cell (symmetric) flow changes to a non-symmetric flow is 
called a symmetry-breaking bifurcation point. Because of the 
non-unique nature of the solution set near a bifurcation point 
it is straightforward to show that 


F(x, A/a) is singular at a bifurcation point (x,A,a), (1.2) 


since, otherwise, the implicit function theorem would guarantee 
uniqueness. This is a key property and we develop the theory 
in detail in section 2, though we note here that if (1.2) holds 
at ario 12) we say that (x15 18) is a singular point. 


The overall objective of this paper is to describe an 
efficient systematic procedure for the numerical solution of 
non-linear multiparameter problems. Continuation methods 
(Keller (1977), Rheinboldt (1980)) have been developed to 
compute ‘bifurcation diagrams for problems of the form 
F(x,A,a) = O with a given. However these techniques require a 
starting point on each connected component of the bifurcation 
diagram and are inefficient when applied to multiparameter 
problems. Here we describe an approach which alleviates these 
two difficulties. The approach is based on combining standard 
continuation methods with the singularity theory of Golubitsky 
and Schaeffer (1985). This approach has been used successfully 
on non-symmetric problems, see Jepson and Spence (1985a), and 
this paper is an extension of that work to problems with 
(reflection) symmetry. An expanded version of this work 
containing proofs and further details is in preparation. 


The following example, given by Golubitsky and Schaeffer 
(1985), Ch.VI, illustrates several important concepts for the 
case n = 1, p = 2 in (1.1). Consider: 


3 
F(x,A,Q): = ae - one + a + a,x - ax m > 1s (1.3a) 


= xa(z,A,a) , z= tee XER, (1. 3b) 


where 
RR xR ROR: 
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In Fig. 2 the varieties Boe By and ibs are shown to divide the 


control parameter space into 5 open regions labelled (1)-(5). 
These varieties are given by 


Boir {acer a(0,A,2) = a, (0,A,a) ‘= 0} (1. 4a) 


2 


B faeR": (i) a(0,A,a) =0; and 


1 


0, x +0} (1.4b) 


(Gi) F(x,A,a) = PY (x,A,8) = F, (x,2,a) 


nE {aer”: a(0,A,a) = a, (O,A,a) =O} . (1.4c) 


Fig. 2 Control-parameter space for (1.3a). 


The qualitative shape of the bifurcation diagrams of (1.3) is 
the same for all values of & in any one region (1)~(5). These 
shapes are sketched in Fig. 2, and we note that bifurcation 
diagrams similar to those found in the Taylor problem are 


recovered if we vary ai and a) > 0. (Follow the dashed line in 
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Fig. 2 as Oy increases through zero). Intuitively we say that 


two bifurcation diagrams are qualitatively similar if they 
have the same number, type and arrangement of singular points. 
This notion is made more precise in Golubitsky and Schaeffer 
(1985). 


The two main objectives of our approach to solving (1.1) can 
now be described. First we aim to completely decompose the 


control parameter space RP into regions for which the 
bifurcation diagrams for (1.1) are qualitatively similar. This 
Means that we need to compute the varieties which divide that 
space (see, for example, B, B and E in Fig. 2). Second we 


aim to provide initial points on each connected component of 


the bifurcation diagram for any given aer?, In this paper we 
discuss the symmetry-breaking case only and so we do not 
concern ourselves with varieties like BL in Fig. 2. 


The plan of the paper is as follows. In section 2 we give 
the theory of singular points in the presence of symmetry. In 
section 3 we describe the singularity theory of Golubitsky and 
Schaeffer which we need for our numerical approach. In section 
4 we briefly outline how systems for singular points of (1.1) 
can be derived from the theoretical results in sections 2 and 3. 
Finally in section 5 we outline the numerical approach and give 


2 
a decomposition of control parameter space R° for the Taylor 
problem. 


2. SINGULAR POINTS IN THE PRESENCE OF SYMMETRY 


In this section we give some theory of bifurcation points. 
As was mentioned in the introduction a necessary condition for 
(x54 Agr) to be a bifurcation point of (1.1) is that 


F.: =F (x_,A_,a.) : xX*X is singular. If this holds then 
x 2707 0ng 


(% 72519) is said to be a singular point, otherwise it is a 
regular point. Whether or not bifurcation occurs at a singular 
point depends on certain conditions on the derivatives of F at 
(x, oto) : typically there will be certain defining conditions 


and some associated non-degeneracy conditions. Throughout this 
paper we shall consider only simple singular points, i.e. 
points (x51 Ào to) satisfying (1.1) and such that 


Null Ce) = span{¢,}, Range (Po) = tyex:y,y = 0, Vex") (2.1) 
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n n 
where X' is the dual space of X. (If X =R , then X' = R and 
the usual dual pairing is given by the inner product voy) . 
Thus, under (2.1), the defining conditions 


o P 
Pog =O , Xo to eX AER” aR, (2.2) 
2$ l0, 


(here 2£ex' and £g -1=0 is simply a normalization of the 
eigenfunction $) with the non-degeneracy conditions 


o o 
YF FO, YF toto tO (2.3) 


characterize a simple "quadratic" turning point, which is the 
most likely type of singular point for a one parameter problem 
without symmetry (see, for example, Spence and Werner {1982)). 
In this paper we consider the case when the equations satisfy 
a reflectional symmetry condition which we represent by the 
following common assumption: 


There exists a linear operator S on X satisfying S + I, 
ai = I, and 
F(Sx,A,a) = SF(x,\a) . (2.4) 


For example, in Cliffe and Spence (1985), F represents a 


variational form of the Navier-Stokes equations, 


pS) wir? 


x (p)? sak: (D) and is given by (4.1) in that paper. In 


the finite-element discretization, x =, R” and S is the matrix 
given by (4.16). 


The symmetry condition (2.4) has several important 
theoretical and numerical implications, though we discuss 
mainly the theoretical aspects here. We refer the reader to 
Cliffe and Spence (1985) for some discussion on the numerical 
implications. 


The following description is based on that in Werner and 
Spence (1984). The operator S induces a decomposition of x 
into 


x= xX, @x, (2.5a) 
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where 
x_ = {xex : Sx = x} , X = {xex : Sx = -x} (2. 5b) 


consist of the symmetric and anti-symmetric elements of X 
respectively. An immediate consequence of (2.5) is that for 
xEX_, 
sS 
(a) F, Fas Faj E€ ht , 


(b) x_ and X_ are invariant with regard to F , F,, F $ 
s a KOEN a, (2.6) 


(c) for vV; wex. or v, wWEX Fix wex, , 
(d) for VEX” weX Fax vwex, r 


where F: = F(x,\,a) etc. Also it is easy to show, using (2.6b), 
that if x eX_ then ¢ e€X_ or ¢ €X_. Of most interest is the 
o's o's ova 
latter case, the so-called, symmetry-breaking case, 
‘ 
i (2.7) 
x eX, , o0eX, Ea 


It follows that 


1e 2.8 
yx = 0 for all xex, (2.8) 


and hence, using (2.6a) and (2.6c), 


Og o = s (2.9 
ee CA YE oro a ) 


Clearly, under (2.4) and (2.7), we do not expect to see 
quadratic turning point singularities since the non-degeneracy 
Conditions (2.3) are automatically violated. In fact the 
generic type of singular point found when (2.1), (2.4) and 
(2.7) hold, which is the "quadratic" symmetry-breaking 
bifurcation point shown in Fig. 1, is characterized by the 
defining conditions 


=O 
F(x AG) , 


a RP 2.10) 
BL (xj Agra), =O 1 MEX n EX AER, GER” , ( 


Lp aie. , 


(note the changes in domains from (2.2.)) and the non- 
degeneracy conditions 
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o o o sitter) 
VoF ato EO AE FO Fv, = Fle vex, (2. 11a) 


o o G O 
VPrxxtototo + VEe FO P= E Ite W EX | (2-11b) 


where Vo and Wọ are well defined because of (2.9). (We remark 


that we show in section 4 how such systems and non-degeneracy 
conditions are derived using the theory in section 2.1 and 
section 3). 


An important theoretical tool for analysing the solutions of 
non-linear problems at and in the neighbourhood of a singular 
point is the Liapunov-Schmidt reduction Process. This is a 
general process by which a problem like (1.1), of possibly 
infinite dimension, is reduced to a problem of small dimension, 


namely, the dimension of Null (Fo) . In our case under 


assumption (2.1) the problem (1.1) will be reduced to an 
equivalent scalar problem, and such problems can then be 
analysed using the singularity theory in Golubitsky and 
Schaeffer (1985). 

2.1 The Liapunov-Schmidt reduction with symmetry 


Under assumptions (2.1) F is a Fredholm operator of index 
zero and therefore induces natural decompositions of X into 


x= x, ® null (Fo) X= Range (F9) er, (2.12) 


where x, and YX, are closed. Let P and Q be the projections 


P:X*K R(P) = N(FS) 7 N(P) =X, (2.13a) 
Qax RIO = ¥ NUO) = Range (F°) d (2.13b) 

Note that 
dim N(F°) = dim Piel me (2.14) 


The key result, on which the reduction process depends, is that 


ho} 
F r x > Range (FD) (2.15a) 


is invertible (see Schechter (1971)), and hence 
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(1-9) F° : X] > Range(F°) (2..15b) 


is invertible, since I-Q acts as the identity on Range (Fe) a 


The reduction process proceeds as follows: 


By i 
n a neighbourhood of (xr dorto) 


1 


(1) write x x, + (I-P)x + Px 


o 
= + 
x, + x, x, A X)EX)1 XEN (FY) 8 (2.16) 


(2) Write F(x,,a) = O as 


(I-Q)F (x, +x, + Xe A, a) =O€ Range (F?) (2.17a) 


1 


QF (x, +x + Xe A, a) =Oe Yz (2.17b) 


1 


(3) Using (2.15b), we can solve (2.17a) for x, in terms of 
Xp and a using the Implicit Function Theorem, i.e., 
x= x (Xy 11) (2.18) 
and substituting this into (2.17b) gives 
OF (x, ha xy (xp rÀ ra) +X 4, a) =O, (2.19) 


which is a mapping from N(Fo) x Rx RP to Y. 


(4) Finally it is convenient to rewrite (2.19) in co-ordinate 
form using 


Oy) 
N(FY) a et, , 
where £ is real, and 
Qz = O implies vz =0. 
Thus (2.19) becomes 
h(e,A,a): =ŅF(x + x (eo, àa) + EG 1A 1a) =O (2.20) 


under the above assumptions. 
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Thus (1.1) is equivalent to (2.20) and information about 
the solutions of the scalar problem (2.20) can be converted 
back, via. (2.17)-(2.19) to information about solutions of 
(1.1). We merely note the following results which are 
readily proved by differentiation and evaluation at (¢,A,a) = 
(0,.5.9,) (see Golubitsky and Schaeffer (1985) ) 


hs (07A 2o) =0, (2.21a) 


which follows immediately from FAS =o, 


hoe OA) = v Forto (2.21b) 
hice ENSS ki WF x oF Lee oF VF rcxtototo (2.210) 
h (0A ra) = VFS (2.214) 
hye (Orda) = VELL, + VF oto (2.21e) 
where 
2 
*lee TI% ron ears ; 


Finally in this section we note that if F satisfies the 
symmetry property (2.4) then it is straightforward to show 
from (2.18) 
% = * (6b 2A) = = x (-e¢ àra) 
and that 
h(e,A,a) = - h(-e,h,a) , (2.22) 


i.e. the reduced problem on X = R satisfies the symmetry 
condition (2.4) with S = -1. Also using (2.9) and (2.21b), 
(2.21d) we have 


hy 0) homo, (2.23) 


and one readily shows that Xce = Zo given by (2.11b) and 


Xia = Ye given by (2.lla). Hence conditions (2.lla) and 
(2.1lb) are precisely hey + O and Roce + O. We discuss these 


links further in section 4. First we have to analyse problems 
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like (2.20) which satisfy (2.22) and this is done next in 
section 3. 


Remark 2.24 


We note that if the symmetry condition (2.4) does not hold 
then it can be readily shown using (2.2la), (2.21b) and (2.214) 
that conditions (2.2) and (2.3) which characterize a quadratic 
turning point are equivalent to 


h(e,A,a) = 0 
h (€7,02) =O 


with h, (€,A,a) + O, ho, (€740) + O. This subject is discussed 
more fully in Jepson and Spence (1985b). 


3. SINGULARITY THEORY WITH Zo -SYMMETRY 


In order to achieve the objectives laid out in the second 


last paragraph in section 1 it is clear that we need a 
computational method both for the calculation of the varieties 
which split up control parameter s ce (for example, HA and BS 
in Fig. 2) and for recognizing and calculating any exceptional 
points on these varieties (for example, O, in Fig. 2). In this 
section we describe the singularity theory which gives our 
numerical approach its theoretical basis. The material is to 
be found in Golubitsky and Schaeffer (1985) and Golubitsky and 
Langford (1981). We omit all proofs in this section; they will 
appear in a later publication. 


It was shown in Section 2.1 how problems like (1.1) 
satisfying the symmetry condition (2.4) might be reduced, under 
appropriate assumptions, to scalar problems of the form 


h(x,A,a) =O, h:RXRXR? >R, (3.1) 
h(-x,A,a) = -h(x,A,a). (3.2) 


(Here we use x instead of £ in section 2.1). Problems 
satisfying (3.2) are said to have Z symmetry and can be 


rewritten as 


h(x,Aya) = x a(zApa) 22x, (3.3) 


i.e. a is evenin x. It turns out that (3.3) is the most 
convenient form for study. 
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First consider problems of the form 


h(x,A) =O, h=RxR+R, 
(3.4) 


h(-x,A) = -h(x,A) , 
with the aim of classifying the Z,-symmetry-breaking 
singularities of h(x,\) = 0 i.e. the points (0,2) where 
h, (0,4) = O (recall (2.2la)) and where bifurcation from x = fe) 


takes place. (Note that for the case x = R, S = -l, then 
Xs = {0} since, using (2.5b), Sx = x implies x = 0). In 


Golubitsky and Langford (1981) the notion of Z3 


used to classify the singularities of (3.4) into equivalence 
classes. 


-equivalence is 


Definition 3.5 
2 
Let h, g: R > R be two smooth (i.e. c’) functions with 
Z,-symmetry, and suppose that h(0,A,) = 0, g(0,0) =O. The 
two bifurcation problems 


h(x,A) =o, 
g(x,A) =O, 
are said to be Z3 ~ equivalent if 
g(x,A) = T(x,A) h(X(x,A) ,A(A)) (3.5a) 


where T is even in x, X is odd in x, and 
i ax 
T(0r0) +O, 5x (0,0) $0, xX(0,0) =0, 


JA (3.5b) 
px (0) $0, Alo) = Se 


Associated with each equivalence class is a number, called the 
%,-codimension, which, if finite, shows that each member of the 


equivalence class is rs equivalent to a polynomial canonical 


form. Also bifurcation diagrams for problems in the same 
equivalence class have the same qualitative behaviour as that 
of the canonical form. The equivalence classes are referred to 
Simply as "singularities". as in Jepson and Spence we 
introduce a graph, which we call the Z -hierarchy of 


singularities, in which are arranged the singularities of 
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Z,-codimension less than 4. The graph, Fig. 3, is structured 


to emphasize the relationships between the singularities and 
to allow a clear description of our numerical approach. The 
(q,j)-singularity is defined to be the singularity which the 
(q,j)-node has at. (0,0). Note that q takes the values 
* 
0,1,2,3 ,3 where for rather technical reasons we distinguish 
tet 
one codimension 3 singularity with a superscript and create 
an extra level between q=2 and q=3 in the graph for this 
singularity. Throughovt this paper O $ q < 3 is taken to 
* 


include the case q = 3. 


xe 2bAx + EN 
E=21 b’$ € 


Fig. 3 Hierarchy of Z,-singularities 


An important concept is the stability of a singularity 
with respect to Z5-symmetry preserving perturbations. 


Definition 3.6 


A singularity of h(x,A,a) = O satisfying (3.2) at (0,45 12) 
is said to be structurally stable if for any smooth r(x,\,a) 
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also satisfying (3.2), for any neighbourhood V of Aso and 
for any 6 with |6| sufficiently small, the perturbed problem 


h(x,A,a) = h(x,Aya) + 6x (x,2,a) (3.6) 


has a singularity of the same type at (0,i(6) 7a(6)), 
(A(5),a(6))ev, (i.e. h is 2,-equivalent to h for ô sufficiently 


small). If in addition (A(5) ,a(6)) = (A (0) „a (0)) + 0(6) the 
singularity is structurally linearly stable. Clearly in 
numerical’ calculations on a physical problem one can only 
expect to compute structurally stable phenomena and this is 
reflected in Theorem 3.14 below, which states that if a 
singularity is structurally stable then we can construct an 
extended system which has an isolated solution at the 
singularity. The Z,-hierarchy is designed such that a 


suitable extended system for a (q,j)-singularity can be 
constructed by following a path down the hierarchy to the 
(q,j)-node and requiring that the label on each branch of the 
path vanishes at (0O,A,a). We assume that the path starts 

above the (0,1)-node and so the equation a = O is always 
included. Thus the extended system for a "quadratic" pitchfork 
bifurcation point, (q,j) =(0,1) is 


A 


pantas, (3.7) 
, 


(cf. Jepson and Spence (1985a), where symmetry is not assumed), 


* 
and the extended system for the (37,2) singularity is 
Aly 13) = (aja aye Oo, (3.8) 
3,2 Ze PA 
As in Jepson and Spence (1985a) we associate non-degeneracy 
conditions (side-constraints) with each (q,j)-extended system. 


These are the conditions, of rid k = 1,2,...K, that the labels 
, 


on the branches leaving the (q,j)-node are non-zero. Thus for 
the (0,1) -singularity we have 


hy apa 2 
rma to, ci Ta toh (3.9) 


The following theorem, which is an immediate consequence of 
proposition 3.47 of Golubitsky and Langford (1981) shows that 
a (q,j)-extended system with appropriate non-degeneracy 
conditions completely characterizes a (q,j) - singularity of 
(3.3). 
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Theorem 3.10 


With O < q < 3, assume that 


A ,(0,A,a) = 0 ert (3.10) 
aj vi 
is a (q,j)-extended system derived from the Z-hierarchy and 
that 
4 (0,A,a) FO k = 12,...,K; (3.11) 


are the appropriate side constraints. Then (0,A,a) is a 
solution of (3.10) satisfying (3.11) if and only if (0,A,a) is 
a (q,j) singularity of (3.3). 


The question arises as to how one might compute such 
singular points. Obviously one must assume that in a physical 
problem any singularity is structurally stable, which 
essentially means that the control parameters in (3.3) enter 
in such a way as to include all possible Z -perturbations of 


(3.3). These concepts are made precise using the concept of a 
%,-unfolding. Briefly, let g(x,A) be a bifurcation problem 


with Z-symmetry, then h(x,\,a) is a p-parameter Z,-unfolding 
h(x,A,B8,) and h satisfies 


of g(x,A) near (0,4, 7B) if g(x,A) 
(3.2). If h and f are Z,-unfoldings of g then h factors 


through f if h(x,A,8) = T(x,A,B) £(X(x,A,B) ,A(A,8) »M(B)) where 
M(B.) = Bo and the usual Z,-equivalence conditions (3.5b) hold. 


h is a universal Z „unfolding of g if all unfoldings of g 
factor through h. The Z,-codimension of g is the number of 
parameters in any universal unfolding. The following theorem 
follows immediately from the definitions. 


Theorem 3.13 


Let (07A rto) be a (q,j)-singularity of h(x,à,a) = O and 
suppose h is a Z,-universal unfolding. Then this (q,j)- 
singularity is structurally linearly stable. 


* 
We can now discuss briefly the situation about the (3 ,2)- 
singularity. The difficulty is explained in Golubitsky and 
* 
Langford (1981) after Theorem 3.19. The (3 ,2)-singularity, 
x Pome + ee O, i + 1 has Z,-cođdimension 3 bécause of the 
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smooth tc”) requirement on the equivalence relation (3.5a). If 
we allow only topological equivalence (i.e. the equivalence 
only involves continuous functions) then the singularity has 
(topological) Z= codimension 2. This is in fact a deep 


theoretical problem concerned with moduli (which we do not 
discuss here) but numerically is quite easy to cope with as we 


* 
see in section 5. Essentially we think of the (3 ,2)- 
singularity having codimension 2. 


Finally we have two very useful theoretical results from the 
computational point of view: 


Theorem 3.14 
With O $ q $ 3 < p, assume (0 rto) is a (q,j)-singularity 
of h(x,A,a) = a(z,A,a)x, z = om Let AG jOa be the (q,3)- 
' 
extended system obtained from the Z,-hierarchy. If h(x,A,a) is 
a universal unfolding of h(x,,0,) then 


oA 
Gri: OS) è 
MES ry) (ogo)? qtl 
oA 
pie Co BG 
i.e. EAN has full rank. 


Theorem 3.15 


Let 1 ¢ q ¢ 3 and suppose (0,05 1%) is a linearly stable (q,j) 


singularity of (3.3). Let j' be chosen so that (q-1,j') is 
a node in the hierarchy above the (q,j) node. Then (5045185) 


is a solution of 
A '= 
q-l,j g 

with 
3 


A ' 
rank ( = 


əd (Àa) 


The importance of Theorems 3.14 and 3.15 is explained in 
section 5. 


(aroo)? =q (i.e. full) . 


4. EXTENDED SYSTEMS FOR (1.1) 


In this section we briefly give the extended systems for 


problems like (1.1) which are derived from the extended systems 


A, j in section 3. We only give those needed in the Taylor 
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problem. 


| Recall that in section 2.1 we showed how for simple 
singular points problems like (1.1) could be converted into 
scalar problems like (2.20) via the Liapunov-Schmidt reduction 
process. This process is reversible and it is straightforward 
to show that the extended system for the (O,1)-singularity 


(Z,-equivalent to ea given by (3.4) with non-degeneracy 
conditions (3.9) is equivalent to the system (2.10) with 
non-degeneracy conditions (2.11). 


The extended system A. = (a,a,) = 0 for (01a) to compute 


1,1 
a (1,1)-singularity (Z-equivalent to aasia) is equivalent to 


&o-1 = 0, (4.1) 
Fao t Faboty = 


Pia, + F° e z+ 1/3 Fe xtototo = 


with unknown (x ‘ot Por AE Oa (a, ) m € x, xX XRXX S xX xR , 


and the extended system Ay 27 (ara) = 0 for (Ara) to compute 
x Siz 
a (1,2)-singularity (2, - equivalent to x -\ x) is equivalent to 


Fin 0, 


x"o 
£o-1 = 0, (4.2) 
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with unknown (x01 toro YoYo" (CFRE) € X 2X RX 1X R. Systems 


(4.1) and (4.2) were first given in Cliffe and Spence (1984) 
where the (1,1)-and (1,2)-singularities were called quartic 
pitchfork and coalescence points respectively. 


It should be clear now how to derive the extended system 


* 
for a(3 ,2)- singularity in (1.1) which is equivalent to 
Are (ara,ra)) = 0. Such a system consists of adding the 


3 ,2 
last three equations of (4.1) to (4.2). 


Fuller details of the various possible extended systems for 
singular points are given in Jepson and Spence (1985b). We 
merely note that numerically useful theorems like 3.10, 3.13, 
3.14 and 3.15 can be proved for the extended systems like 
(2.10) (4.1) and (4.2). 


5. NUMERICAL APPROACH AND RESULTS FOR THE TAYLOR PROBLEM 


The numerical approach is essentially the same as that 
described in Jepson and Spence (1984), (1985a). Here we give 
a brief summary and present some numerical results for the 
Taylor problem. We describe the technique for a problem like 
(3.1). The extension to problems like (1.1) is clear. 


Step 1 Descending the hierarchy 


We begin by trying to locate the points of highest 
codimension in a given problem like (3.1). To do this we 
descend the hierarchy as follows. Assume at some stage we 
know a solution (Orra) of the (q,j)-extended system 


Ag, 5 (OrAra) = Ag 5 ZEO (5.1) 


where z = (0,i,a), B = = (gears Sp) with a= 


Sgt! N 
(oy res rag) the unfolding parameters ßBeR and yer? I1 called the 


slice parameters. For the slice parameters held constant, 
(5.1) may be viewed as a one parameter bifurcation problem with 
state variable z, and bifurcation parameter B. From theorem 
3.14 we see that (0,A,0,) corresponds to a regular point of 


(5.1) and hence paths of such points can be computed using 
standard continuation algorithms. Also it is important to note 
that Theorem 3.15 implies that the extended systems are well 
behaved at codimension q+l singularities. In particular we can 
expect only simple turning points in (5.1) as 8 varies. 
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Singular points of codimension q+l on this path are 
recognized by locating a range of values of 8 at which a 


nen-degeneracy condition œŒ changes sign. It may be computed 


qj 
using the appropriate extended system and the process of 
following paths of such (q+1)-codimension singularities is 
repeated. In this way the hierarchy is descended until the 
singularity with the highest codimension is located. 


Step 2 Ascending the hierarchy 


Given a (q,j)-singularity (z,8,y) that is a regular point of 
(5.1) the point is certainly a solution of 


Beenie amt erp) 2.0 (5.2) 
where the (q-l,j')-nođe is a node in the hierarchy directly 
above the (q,j)-node and (z',ßB') = z, y' = (8,y). Then theorem 
3.15 shows that (5.2) has, at worst, a simple turning point at 
the (q,j)-singular point. Standard continuation methods can 
now be used to compute solution branches of (5.2). 


Step 3 Obtaining initial points 


By unfolding all the singular points of codimension 2 1 we 
can obtain all the quadratic pitchfork points and turning 
points and hence obtain starting values on each connected 
component of the bifurcation diagram of (3.1) which contains 
at least one singular point. 


Calculations were done on the Taylor problem described in 
section 1, some numerical results for the case of %3 fixed have 


already been reported, Cliffe (1983) and Cliffe and Spence 
(1984), and these numerical results agree well with the 
experimental results of Benjamin and Mullin (1981). We present 
here, in Fig. 4, a plot of the control parameter space where 


oS aspect ratio and a = radius ratio. The dashed line 
corresponds to the results for a, = 0.615, the value in the 


Benjamin and Mullin experiment. In Fig. 4 we only give the 
varieties He and Bo: Variety Ho is the path of (1,1)- 


singularities (quartic bifurcation points) computed using (4.1) 
and letting a, vary, and By is the path of (1.2) - singularities 


(coalescence points) computed using (4.2) and letting a5 vary. 


These paths touch tangentially at O as predicted in Fig. 2, 
which gives the control parameter space for the unfolding of the 


canonical form of the (3*,2) singularity. The point O is 
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recognised on fs because the side constraint ay +O 


(equivalently (2.1la)) is violated. Finally we note that for 
the Taylor problem we can derive a condition equivalent to 
m> 1 in (1.3a) and this can be confirmed numerically. 


0-9 


08 


0-7 


0-6 Á 
10 14 12 1st 


Fig. 4 Control-parameter-space plot of solutions for the 
finite Taylor problem. 
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NUMERICAL CALCULATIONS OF BIFURCATIONS 
IN THE FINITE TAYLOR PROBLEM 


K.A. Cliffe 
(Theoretical Physics Division, AERE Harwell) 


and 


A. Spence 
(School of Mathematics, University of Bath) 


1. INTRODUCTION 


The Taylor problem of flow between concentric circular 
cylinders has been studied for over 60 years. Despite this 
intensive investigation it is still producing new phenomena 
which are not yet fully understood. In this paper we shall be 
concerned with the application of recently developed techniques 
in numerical bifurcation theory to the steady, axisymmetric 
flows possible in the Taylor experiment. The experimental 
situation we have in mind consists of two concentric circular 
cylinders. The inner cylindrical wall rotates and the outer 
cylinder and both ends are stationary. The annular gap 
between the cylinders is filled with a fluid and it is the 
motion of this fluid that is studied. One of the ends consists 
of a moveable annular collar so that the length of the annulus 
may be adjusted. Benjamin and Mullin (Benjamin, 1978; Benjamin 
and Mullin, 1981; Mullin, 1982; Mullin, Pfister and Lorenzen, 
1982) have carried out several experimental investigations of 
flow in the above apparatus and have discovered an interesting 
variety of bifurcation phenomena. 


From the point of view of bifurcation theory Benjamin and 
Mullin's apparatus has several salient features. Firstly the 
equations governing the steady motion of the fluid are non- 
linear; secondly the problem has two parameters which may be 
adjusted, namely the speed of the inner cylinder (in non- 
dimensional form the Reynolds number, R) and the length of the 
annulus (in non-dimensional form the aspect ratio, T); finally 
the apparatus is mirror symmetric about the midplane of the 
annulus. (The apparatus also possesses axisymmetry about the 
axis of the cylinders, however all the flows considered in this 
paper preserve this symmetry and so it has no effect on the 
structure of the bifurcations). 
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The approach we have adopted in studying this problem is 
to discretize the governing boundary value problem by 
the finite-element method. This leads to a finite dimensional 
set of equations with two adjustable parameters R and IT. The 
equations also possess Z3 or mirror symmetry. The solution 


structure of these equations is determined by applying numerical 
methods for two-parameter problems with Z,-symetry. (Jepson 


and Spence, 1980; Werner and Spence, 1984; Cliffe, Jepson 
and Spence, 1985). 


The rest of the paper is as follows. In section 2 we 
present the governing equations for the problem and describe 
the finite-element discretization of these equations in section 
3. In section 4 we discuss the implications of Z,- symmetry. i 


In section 5 we describe the various systems of equations that 
can be used to compute singular points in problems with two 
parameters and Z -symmetry. Finally in section 6 we illustrate 


the use of these techniques on a problem in the Taylor 
experiment (Cliffe, 1985). 
2. GOVERNING EQUATIONS 

The equations describing the flow of an incompressible, 
viscous fluid in the Taylor experiment are the Bere sacokes 


equations. In cylindrical polar coordinates a Palen ) with 
origin midway between the ends of the annulus and velocity 


x xk * 
(a magn ) the equations for steady, axisymmetric flow are: 


ðu ðu Tu 


>ar HEEL ge 
RIRES x te az rep * i ðr 


r ðu, du. ðu, 2ru 


EE EE, r pee (LEENE, EANTA j 
(eee) ae 748) ae a at pay oe, 62>) 
(r+8) 


du du wu 
Rusa, aba pee, 
ne z oz (x+8) | 


2 | 
ðu ð u Tu 
sD OF Sa E ET AET fen LES 
ze ar FTP) ar T 2+ Bo wD hae A2) 


az (r+8) 
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du, du, ap 
R (Tu, or hs uz 3z E əz 
2 
ðu ðu, au 
E A P 4g VE PAE gy oe anne is 
EREN ns Te ee 
du 

ee pea A 
(r+ß) Ox (Erp) art gene oe (2.4) 


In the above equations r, z, (a, Uge u) and p are given by: 


* * tas z v4 * 
u_,u,,u 
eels molec AAE TERN ee aiSRiy. 
rea B, z t , u pagra) r;a 1P ura ' 
and 
oh aaa a ite 
Le DAA ih it P 
where 


B= r,/a = n/(l-n) and n is the radius ratio; d= ror where 


ry and r, are the radii of the outer and inner cylinders 


respectively; £ is the length of the annulus; 2 is the 
angular velocity of the inner cylinder and v and u are the 
kinematic and dynamic viscosity of the fluid respectively. 


Equations (2.1)-(2.4) hold in the region 


D = {(r,z)| Osrsl , -0.5zg0.5} . (2.5) 


The boundary conditions are that u and u, are zero on the 
entire boundary of D, and that uy is zero on the outer cylinder 


(r=1) and one on the inner cylinder (r=0). On the ends 
(z=+0.5) Si is zero except near the inner cylinder where it 


increases smoothly to one over a small distance, €. The exact 
value of € and the variation of K will depend on the details 


of the experiment; however, we have found the results to be 
insensitive to the value of € and therefore conclude that any 
sufficiently small value will be adequate. We note that £ must 
be positive because when € = O the rate of dissipation of energy 
in the fluid becomes infinite (see Benjamin and Mullin, 1981, 
‘pp. 224, 225). 
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3. FINITE-ELEMENT DISCRETIZATION 


As the starting point for the finite-element discretization, 
equations (2.1)-(2.4) are converted into a non-linear operator 
equation in an appropriate Hilbert space. We introduce the 
following notation: let md (D) be Es “ei of functions which 
are square integrable over D; let wt 2(D) be the ae of 
ee whose generalized first derivatives lie in L? (D) and 
let "W 2 D) be that subspace of wh 2D) whose elements vanish 
PER on the boundary of D. wi!? (p)3 is the space of vector 
valued functions each component of which is in wi? (p) . 


Finally let H= ws" 20D 3 x 1? (p) + We introduce the following 


functionals: 
av. av, 
a(U;V/W,R,T,n) =R f [{(x+8) (Ta, ae + 8, ay —) -r uy Iw, 
ov av 
E -a 
+ {(x+B) (T Oras moses) to Be vo 
ey. av, 
+ {(r+8) (T ae trae IW IIa Gi 
w. ðu, ðu, ðw 
ét A 8 r t r 2) x 
b, (UW R,T yn) Ten Lene? Se ann ea + or l ge 
uw, ðu, dw ðu, ow 
TX Le o , Lere) “Me o 
Daa OO eee ae T az az 
uw du, du, ow, (x48) du, aw. 
EEIen | et ON ge eee a eee) 
aw, 
- Í [T p > T (r+8)w + p(r+6) TA 
ðu 


fitas Ł (x+8)u_ + s(x+8) = 7 (3.2) 
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where u = (u eae PP), V Vv = (v BEAL, 1q) and W = Myris) 


gto ,0)€H, where a, satisfies the 


boundary conditions on the azimuthal component of velocity, and 
let 


with U,V,WeH. Let U = (0,a 


b(U WRT yn) = aly WR Tn) + a(UsOWRT en) + by (PWR Tn), 
(3.3) 


c(WR Tn) = a(Gs0,WRT yn) +b (ÜW, RT) . (3.4) 


We now define the operator 
A:HxRxRxR?H , 


by 
AYR Tn) =W, 


where 


a(U;U,V,R,T,n) + b(U,Y,R,;T,n) + c(V,R,T,n) = <W,y> for all VeH, 
(3.5) 


and <.,.> denotes the inner product in the Hilbert space H. 
For fixed U the left hand side of (3.5) defines a bounded 
linear functional on H, which by the Reisz representation 
theorem, means W exists, and thus A is well defined. A 
classical solution of (2.1)-(2.4) satisfies 


A(U,R,T,n) =Q. (3.6) 


Solutions of (3.6) are called weak or generalized solutions and 
may, under certain smoothness conditions, be shown to be 
classical solutions of (2.1)-(2.4). 


We turn now to the finite-element approximation of (2.1)- 
(2.4) for each h>O er Wa ae M, be finite-dimensional spaces 
1,2 ' 
E = 
ES be aig, m? 1M a (D), and let May o WOW, (D) 
Let Ya = EE 100) € Ha (2W XM) + where By in approximates 
the boundary conditions on the azimuthal component of velocity, 


and let 


(Ui FV Wy R T,n) = aU; Vaip E T,n) for U Yp Yh" 
“heh shes th Yh ene eh 
(3.7) 
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bp pR T = aG rO WRT en) + alu, 70.0, RT yn) 


+ b) (UW, RT yn) for UW, eH ' (3.8) 


oR en) = al sO, RTN) + B(OH R/T yn) for W eB,- 


(3.9) 
The finite-element aprroximation of A, 
AB. xRXRXR>+ Ba 
is defined by 
A (URT) = Wh 
where 
ah UU Yp RT) + bp (Yh Yp RT) + Ch Yp RTN) = 
“W/V? for all y €H,- (3.10) 


The finite-element space W, is generated by none-node 


h,oO 
isoparametric quadrilateral elements with biquadratic 
interpolation and the space Mi is generated by piecewise linear 


interpolation on the same elements, the interpolation being, in 
general, discontinuous across elemert boundaries (Engleman et 
al., 1982). The parameter h is the length of the longest edge 
in the mesh. There is a natural basis associated with this 
finite-element approximation in which each coefficient 
appearing in any linear combination of the basis functions is 
either the value of the velocity or pressure or one of the 
first derivatives of the pressure at a node in the grid. We 
denote this basis by 


n 
Mihia 


where n is the number of degrees of freedom in the 
discretization. Any gee, has a unique representation of the 
form 

N (3.11) 


bo kta 
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We can now write the discretized equations (3.10) in the form 
of a set of non-linear equations for the coefficients u in 


(3.11) as follows. We define £:R" x Rx R x R R” by 
f (WR Ten) = <A (WR Tan) N > s isl, ee nn (3.12) 
Thus the discretized equations take the form 
ECUR Tan) =O. (3.13) 


Equations (3.13) are treated by numerical methods for 
bifurcation problems and we shall describe these in section 5. 


4. THE TREATMENT OF SYMMETRY 


As we mentioned in the introduction an important feature of 
the particular version of the Taylor problem we are considering 
is that the apparatus is mirror symmetric about the mid-plane 
of the annulus. In this section we describe how this 
symmetry is represented mathematically and discuss its 
implications for the discretization. 


To begin we define the reflection symmetry operator S:H > H 
by 


stu, (r,2z) ,u, (x,2) ,u, (x12) rP(x,z)} = {u,(r,-z), Uy (£772) ru, (£,-2), 


p(x,-z)} (4.1) 


(3 


for smooth functions, and use continuity to extend the 
definition to the whole space. It is easy to see that s+I and 


that 57=1. Since the boundary conditions of the problem are 
symmetric we may take 


st=0. (4.2) 


The change of variables (r,z) > (r, -z) in the integrals in 
(3.1) and (3.2) gives 


a(SU;SV,W/R,Tyn) = a(UsV/SWRT yn) (4.3) 


b) (SU,W/R,T yn) b, (U/SW/R,T yn) . (4.4) 


Similarly we have 
<SU,W> = <U,SW> . (4.5) 
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Now, using (3.3)-(3.5) and (4.2)-(4.5) it follows that 


A(SU,R,T,n) = SA(U,R,I,n) « (4.6) 


It is clearly important that the discretization process 
preserves the symmetry properties of the original equations 
(Brezzi, Rappaz and Raviart, 1981). Since HCH we can use the 


same operator S in the discrete case. It is easy to see that 
S:H 7H, if and only if the finite element mesh is symmetric 


about the line z=0. If this condition is satisfied then we may 


choose uy such that 


Q= 


sů, = (4.7) 


and it then follows from (3.7-3.10), (4.4), (4.5) and (4.7) 
that 


Ap (SU, R/T n) ad SA, (Up R Ten) . (4.8) 
Thus, provided the finite-element mesh is symmetric, the 
discrete problem has the same symmetry property as the original 


problem. 


We define Dcp by 


o 
L 


= {(r,z)|osrsl, ogzg0.5} , (4.9) 


and note that 


+ + 
a (U;V,W,R,T,n) + a (SU;SV,SW,R,I,n), 


(4.10) 


n 


a (U Y,W,R, Tn) 


+ 
b (UWR, T, n) = b) (UW, RT n) + bY (SU,SW, R T,n) , 
EYS (4.11) 
where the superscript + indicates that the integral in (3.1) 
+ 
and (3.2) is over D rather than D. These results are used 
in the next section. 
We now construct a representation of the symmetry operator 


S on E, First we label all the degrees of freedom in the 
problem. Those associated with nodes on the line z=0 we label 
from 1 to 2. We assume that this line coincides with element 
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boundaries, and since the pressure degrees of freedom for the 
element we are using are all associated with the central node 
(see Fig. 1), there are no pressure degrees of freedom on it. 


We then label all the other degrees of freedom in pt from 2+1 


to £4+m. Finally the degrees of freedom in p-p* are labelled 
from 2£+m+l to 2£+2m(=n) in such a way that a velocity (or 
pressure) freedom at a node with coordinates (r,z) is labelled 
i (2+m+lsis2+2m) where i-m is the label of the corresponding 
velocity (or pressure) freedom at the node with coordinates 
(x,-z). It follows from the symmetry of the grid and the 
above labelling scheme that: 


SN, = EDN) ' (4.12a) 
where 
i if lsis2 
j = itm if &+lsishtm , (4.12b) 
i-m if &£+m+l<is2+2m 
and 
eli) =+1. (4.12c) 
4 7 3 
8 6 
1 5 2 


Fig. 1 The nine-node quadrilateral element used for the 
calculations. Velocity values are associated with 
nodes 1-9, the pressure and both its first derivatives 
are associated with node 9 only. 


The sign in (4.12c) depends on the particular freedom in 
question. If it is the r- or $- component of velocity or the 
pressure or its r-derivative the sign is positive. If it is 
the z-component of the velocity or the z-derivative of pressure 
the sign is negative. This follows directly from the 
definition of S (4.1). Let Uh € Ha’ then using (4.12) we have 
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L £+m 


n 
SU. = £ u.,SN, = £ u,e(i) N, a E u,,e(itm) N., 
shy Dla aie aed SAU ES i 
2+2m oe) 
+ = u,__e(i-m)N,. 
istm +? i 
Let 
E (1) (2) o 
E= a (4.14) 
o (n) 
and 
|z 01.0 
g 
P= o o or (4.15) 
m 
|° zr, 0 


where I is the lxt identity matrix etc. Equation (4.13) may 
now be written 


h jay 2 ~4 
where 
y=Su, 
and 
S = PEP . (4.16) 


Thus s:r” is the required representation of S on Roy and the 
function f defined in (3.12) satisfies 


§ £(u,R,T) = £(8u,R,T) , (4.17) 


which follows immediately from (4.8) and (4.16). Clearly $ + 1 


a2 
and $° = PEPPEP = I since P? = E? = 7, 
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5. SYSTEMS FOR COMPUTING SINGULAR POINTS IN TWO PARAMETER 
PROBLEMS 


Equation (3.13) has three parameters in it, however, in an 
experiment it is only possible to vary the Reynolds number, R, 
and the aspect ratio [. The radius ratio, n, remains fixed. 
For the rest of the paper, then, we will omit the explicit 
dependence on n and treat (3.13) as a two parameter problem. 
Thus we want to find all (u,R,T) that satisfy (3.13). This 


equation defines a "two-dimensional" surface in R XRXR. 

The techniques for solving this problem will be described in 
the next two subsections and are based on Jepson and Spence 
(1985) and Cliffe, Jepson and Spence (1985). In subsection 


5.1 we discuss continuation and extended systems for 
singularities when the Z,-symmetry is not broken and in subsection 


5.2 we consider singularities where the Z,-symmetry is broken. 
In subsection 5.3 we point out the numerical implications of 


the symmetry. 


5.l Two parameter problems without symmetry 


First we consider the problem of solving a one parameter 
problem: 


g(x,A) = Q XER, eR . 


The basic continuation method (Keller 1977) introduces an arc 
length parameter s to parametrize the solution. This involves 
solving the system of equations 


g(x) =O, 
N(x,A,s) =O, 
where 
as T EN 
N(x,A,s) = Jg (So) (x(s)-x(s.)) + Fs (59) (A(s) -A(s,)) -(s-s,) 
(5.1.1) 


by Euler-Newton continuation applied to the parameter s 

(Keller 1977). The reason for introducing s is that when there 
is a limit point on the solution branch ordinary continuation 
methods fail whereas the Keller method will succeed in 
following the branch around the limit point. 


System (5.1.1) is used to calculate slices through the "two- 
dimensional" surface by fixing I (or R) at I = TS (or R= Ro) , 


with x=u,A = R (or T) and g(.,-) = flar To) (ox ECR). 
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A limit point may be characterized as an isolated solution of 
the following extended system (Moore and Spence 1980) 


£(u,R,T) 
EYD £ (wR Tp] 0 (5.1.2) 


ot g-1 


with y = (u,$,R), per” and Ler”. The set of points (u,R,T) 
such that (5.1.2) is satisfied by (y,T) defines a curve in 


R° XRXR which Jepson and Spence (1985) call a fold curve. 
Fig. 2 illustrates the reason for this term: the surface is 
folded back on itself and the fold curve marks the edge of the 
fold. To calculate this curve we apply the Keller arc length 
continuation method (5.1.1) to (5.1.2) with x = Y, ` =T and 
Ilr) = F(.ye)- a S 

u 


R Fold curve 


2) 


Solution surface 


Fig. 2 Part of the solution surface of a two parameter 
problem (R and I) illustrating a fold in the surface 
and the fold curve, 


The fold curve may itself have limit points, these 
correspond to points at which the solution to (5.1.2) is not 
isolated, so that in the neighbourhood of the point there will 
be two or more limit points for each value of Į. In a two 
parameter problem these limit points on the fold curve can occur 
in two ways that are stable to small perturbations. We note 
that only stable singularities can þe computed as there are 
always small perturbations present due to rounding errors. At 
these special points an extra equation must be satisfied in 
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addition to (5.1.2). 


P Let uto Polo satisfy (5.1.2) and denote £(u R/T) by 
£ etc. Let t be the left eigenvector of E If in addition 


to (5.1.2) the solution satisfies 


o 
(38 fa TO , (5.1.3) 


where the subscript R denotes differentiation with respect to 
R, the point Ma Ro'To is either a transcritical bifurcation 


point (which we shall denote by T) or else an isola formation 
point, depending on the sign of a scalar quantity involving 
higher derivatives of f (Jepson and Spence, 1985). 
Bifurcation diagrams near T are given in Fig. 3. 


= 


<m ER Porm 


Fig. 3 State diagrams near a transcritical bifurcation point, 
T. The critical value of the second parameter I is 
Ta? the order of the diagrams could be reversed. 

If in addition to (5.1.2) the solution satisfies 


Yo on s5 gi Tiy (5.1.4) 


the point (a, Ro To? is a non-degenerate hysteresis point 


which we shall denote by H (Jepson and Spence, 1985; 
Golubitsky and Schaeffer, 1979). The bifurcation diagrams in 
this case are shown in Fig. 4. The projection of the fold 
curve onto the R-I plane has a characteristic cusp shape, with 
the tip of the cusp at the hysteresis point. 
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ok 


KR Pet, A 


Fig. 4 State diagrams near a non-degenerate hysteresis point, 
H. The critical value of the second parameter I is 
T the order of the diagrams could be reversed. 


5.2 Two parameter problems with Z -symmetry 


We now consider the implications of the Z,-symmetry. This 


subject is covered in detail by Cliffe, Jepson and Spence 
(1985) and so the treatment will be brief. The mapping S 


induces a natural decomposition of RS into 
R =x ©Xx (5.2.1) 
xX, OX +2. 
where 
x, = {xe R"|Sx=x} x= {xer"|Sx = -x} , (5.2.2) 
consist of the symmetric and antisymmetric elements of R 
respectively. A symmetry-breaking bifurcation point may be 


characterized as an isolated solution of the following 
extended system (Werner and Spence 1984) 


£(u,R,T) Y = (u,$,R)ey 
F(y,T) = £ CURT) $+ = 0 Y= X_XX_xR (5.2.3) 
ae F:Y¥xR-Y 


It is important to note both the similarity with (5.1.2) and 
the essential differences, namely that u must belong to x, and 


$ must belong to z Thus the basic solution is symmetric but 


the eigenvector is antisymmetric so that the bifurcating 
branch is asymmetric. In direct analogy with a path of limit 
points, the set of points (u,R,T) such that (5.2.3) is 
satisfied by (y,I) defines a curve which lies on the symmetric 
solution surface. This curve is the intersection of the 
surface of asymmetric solutions with the symmetric ones. 
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A path of symmetry-breaking bifurcation points may have a 


turning point and this happens if 


o 
toffru fo * fm fo R TO! (552) 
where 
o OPR 
fa Ye th, TO ; Rey 2 . (5.2.5) 


Cliffe and Spence (1984) call this type of singularity a 
point of coalescence because two symmetry-breaking bifurcation 


points coalesce at such a point. 


The bifurcation diagrams 


depend on the sign of a scalar quantity involving higher 


derivatives of f (we shall not give the details here). 


We 


denote these two cases by ct and C and the relevant diagrams 


are shown in Fig. 5. 
u 


a 


rep ER >m 
Fig. 5a 
t 
ey Nk 
R 
MR R rR 
Fig. 5b 


Fig. 5 State diagrams 


near the two types coalescence point, 


(a) ct and (b) C. The critical value of the second 


parameter l is To 


be reversed. 


; the order of the diagrams could 


Another type of singularity that can occur on a path of 
symmetry-breaking bifurcation points is a quartic symmetry- 
breaking bifurcation point which we denote by Q (Cliffe and 


Spence, 1984). 


Here the bifurcating asymmetric branch has 


quartic dependence on the excess Reynolds number near the 


singularity rather than quadratic dependence. 


In addition to 
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(5.2.3) the quartic point satisfies 


o o 
= «2.6 
Yo Cen AI aan £6 Zo) igo s ) 
where 
o o 
= . .2.7 
fo za + 2, fato TOZ (5.2.7) 


The bifurcation diagrams near such a point are shown in Fig. 6. 
Note that a pair of fold curves of the asymmetric solution 
surface splits off from the symmetric solution at a quartic 
symmetry-breaking bifurcation point. 


Quadratic Quartic 
A contact contact 
MKR ER ri 


Fig. 6 State diagrams near a quartic symmetry~breaking 
bifurcation point, Q. The critical value of the 
second parameter I is Tas the order of the diagrams 


could be reversed. 


The final type of singularity which can occur along a path 
of symmetry-breaking bifurcation points is called a double 
singular S-point, which we shall denote by D (Werner, 1984). 
This is a point at which the path passes over a fold so that 
a limit point and symmetry-breaking bifurcation point coincide. 
The equations satisfied by such a singularity are 


£ 
epee fs © Sa 
$ =O" te 
u ~va $a ex.- (5.2.8) 
45 2, 2 
ea DaT 


5.3 Numerical implications of symmetry 


In this subsection we discuss some of the implications of 
symmetry for the calculation of symmetry breaking bifurcation 
points. In particular we shall show that the number of degrees 


See 


— 


ane 


ee 
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of freedom involved in solving each equation in (5.2.3) is 
about n/2 or roughly half the total number of degrees of 
freedom in the problem. 


First we define (cf. (5.2.2)) 

H = {y € a, lsy miu ga 

= -u,}. (5.3.1) 
Now (5.2.3) may be written in the form: find Y, € HP, $ © HO 


such that 


ah (OO WRT en) + by (UO, sW, R/T yn) 


+ Ch Wy RT en) = 0 for all Wh e€ Ha , (5.3.2a) 

ap (Yh fn WRT en) + a (9/0, ip RTN) 

+ by (Bhp RTN) = O for all Wa E€ Hye (5.3.2b) 
£6 -l =0 (5.3.2c) 


~h 


s 
It follows from UL € He h € Hp and (3.7)-(3.9), (4.7) and 


(4.10), (4.11) that (5.3.2) may be written as 


+ + 
a (Up Up Wp + SW RT) +b (UW, + SW RTN) 


+ 
+ chh + SW, RTN) = 0, for all Wa € Ha’ (5.3.3a) 


at (U3 W -SW 


+ 
nh fn MSW, RT n) + a (@ FU,» W,-SW, R/T yn) 


=H 
+ ba W,-SW, R/T yn) = 0, for all Wa E Hy (5.3.3b) 
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-l= 3 (5.3.30) 
kia l=0 


$ 
Equations (5.3.3) involve integrals over the half region D , 
it follows that (5.3.3a) and (5.3.3b) are equations for Va and 


h (equivalently u and ¢) which involve approximately n/2 


degrees of freedom each. 


It is informative to regard equations (5.3.3) as the 
discretization of a boundary value problem for u (E Wy a 
+ 
and ¢ (= Gr bye em) on the region D . The boundary 


conditions on the line z = O are 


du du 
—= = tat = = 
az az GE ee 
ag 
= ee 
= bo əz Qas 


6. RESULTS 


In this section we briefly describe some calculations 
performed using the numerical techniques discussed in the 
previous sections. The calculations are relevant to the way 
in which the number of Taylor cells present in the apparatus 
changes from four to six as the aspect ratio increases. A 
detailed description of the problem and the results can be 
found in Cliffe (1985). 


Fig. 7 shows the locus of paths of limit points (or fold 
curve) and symmetry-breaking bifurcation points in the R-I 
plane. The radius ratio was fixed at n=0.6 to correspond to 
the experiment of Mullin (1982). The fold curve is indicated 
by the solid line and there is a nondegenerate hysteresis 
point H and transcritical bifurcation point T on this curve. 


The path of symmetry-breaking bifurcation points (dashed 
curve) is rather interesting. At the bottom right hand corner 
of the figure the bifurcation is supercritical, that is the 
asymmetric branches exist for R greater than the critical 
value. This path moves from one side of the fold surface to 
the other at the point D. an enlarged diagram of the region 
near D is shown in the inset in Fig. 7. at the point Q there 
is a quartic symmetry-breaking bifurcation point and at aspect 
ratios greater than that at Q the bifurcation is subcritical. 
The solution branch bends back round as indicated in Fig. 6, 
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giving rise to a path of limit points in the asymmetric 
solutions which are indicated by the chained line in Fig. 7. 

At C there is a turning point in the path of symmetry-breaking 
bifurcation points and the solution diagrams near this point 
look like those in Figs 5b. Finally there is a second turning 
point in the path at ct where the solution diagrams look like 


those in Fig. 5a. 


iS 

g 

5 

v 

a 

uw 

< 
a L | n N 

60 110 160 210 260 310 
Reynolds number, 
Fig. 7 Critical loci for four- and six-cell modes with radius 


ratio 0.6: numerically calculated fold curve, 
---- numerically calculated path of symmetry-breaking 
bifurcation points, -.-.- numerically calculated fold 
curve for asymmetric solution surface. 
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HYPERBOLIC PRECONDITIONING TO SOLVE THE INCOMPRESSIBLE 
EULER EQUATIONS FOR VORTEX FLOW 


A. Rizzi* 
(FFA The Aeronautical Research Institute of Sweden, 
S-161 11, Bromma, Sweden) 


l. INTRODUCTION 


The ability to compute detailed information about 
incompressible flowfields by the numerical solution of a set of 
governing equations offers great utility to industrial 
designers of turbomachinery, internal ducts, and pumps as well 
as transport vehicles, be they ocean-going ships, road vehicles, 
or airplanes. Most of these applications involve flows having 
very high Reynolds numbers, and the potential equation is what 
first comes to mind as the suitable flow model. Many of the 
flows, however, possess substantial regions with rotation, one 
very common feature being the shedding of vorticity from sharp 
edges. Working with the potential model, one faces the 
cumbersome task of setting up vortex-sheet discontinuities in 
the field and then adjusting or fitting them to the surrounding 
flow. This requires prior knowledge of where the sheets begin 
and becomes very complicated for all but the simplest 
situations. The alternative, especially when the vortex 
topology is complex or unknown, is to adopt the Euler equations 
as the flow model because they allow rotational flow everywhere 
and vortical regions are "captured" implicitly as an integral 
part of the solution. 


The recent discovery that a vortex sheet can be generated 
and captured in the solution of a compressible flowfield with 
separation from the leading edge of a delta wing at high angle 
of attack has stirred up a lot of activity in the development 
of numerical methods to solve the compressible Euler equations. 
Unfortunately these methods do not work very well for an 
incompressible problem simply by setting the freestream Mach 
number to a very small value, zero in the limit, because with 


*also Adjunct Professor, Royal Institute of Technology, Stockholm 


200 RIZZI 


decreasing Mach number the speed of the sound waves becomes 
much larger than the speed of convection. This increasing 
disparity in wave speeds causes the governing system of 
hyperbolic equations to be poorly conditioned, and the 
stability of the computation is greatly impaired. The 
conventional approach to solve the equations is to recast them 
into the vorticity-streamfunction formulation, but while 
effective in two-dimensional problems, it turns out to be 
impractical in three. If, however, the interest is only in 
steady three-dimensional flow, artificial compressibility is 
one way around the difficulty because this approach removes 
the sound waves from the system by prescribing a pseudotime 
evolution for the pressure which is hyperbolic and which 
converges to the true steady-state value. In a theoretical 
study Viviand (1983) recently has generalized this concept of 
a pseudo-unsteady approach, and he has defined a broad class 
of such systems that includes the one of artificial 
compressibility. He did not, however, demonstrate any of them 
in an actual computation. The purpose here is to investigate 
in detail two particular systems from the class of Viviand and 
compare them to artificial compressibility (Rizzi and Eriksson, 
1985). I show how the pseudo-unsteady approach leads to a 
hyperbolic system, carry out a numerical study of its 
condition, set forth the CFL stability limit for the time 
integration, and examine the types of discontinuities that it 
admits. The method can be seen as a hyperbolic preconditioning 
technique because it modifies the wave character of the system 
to be optimal for numerical solution by a hyperbolic | 
integration scheme. 


2. FLOW SEPARATION WITH EULER EQUATIONS 


The central issues then become physical rather than 
numerical, i.e. to understand how vorticity is created in the 
flow, to calibrate the diffusion of the vorticity, and to 
gauge to what extent errors in total pressure degrade the 
overall accuracy of the solution. The first Perhaps is most 
controversial. If an oncoming stream of fluid is initially 
irrotational, how, in the absence of irreversible processes, 
can vorticity appear in the flow? Consider now a solid body 
that presents an obstacle to the flow. The fluia wets the 
surface and at some point, or along some line, it. detaches 
from the body and continues downstream. If the flow is 
potential, its velocity nowhere can be in shear. But if it 
obeys the Euler equations, the flow can separate along a line 
on the surface, across which the velocity may be in shear. The 
discontinuity in velocity direction then extends in a surface 
out into the flow which is called a vortex sheet. The total 
pressure On each side of the sheet is the same as that upstream 
of the body. No losses have been inflicted and vorticity 
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exists only inside the vortex sheet, an infinitesimally thin 
surface of discontinuity, which is also a stream surface. The 
circulation along every closed fluid circuit therefore remains 
the same. The sheet, however, is linearly unstable and begins 
to deform according to the Kelvin-Helmholtz instability. But 
in some cases, like a stream sweeping past a sharp edge at 
some angle (Fig. 1) the sheet can, under the influence of its 
own vorticity, coil up to form a steady but infinite spiral. 


Loo, 


Fig. 1 Steady tip vortex formed by flow sweeping obliquely 
past a sharp edge. 


But allowed just the slightest amount of diffusion, vorticity 
begins to seep out of the sheet and forms a vortex core after 
a finite number of turns in the spiral. Do we observe such 
phenomena in a numerical simulation? To try to answer this 
let us look at a numerical solution to this problem in its 
simplest possible setting, incompressible flow past a flat 
delta wing in which a stable vortex sheet is shed from the 
leading edge and then coils up into a steady vortex over the 
wing (Fig. 2). 


Fig. 2 leading-edge vortices on lee side of a delta wing. 


I analyze the qualitative as well as quantitative aspects of 
this flow, computed by hyperbolic preconditioning, in 
comparison to the results of a 3D panel method that fits the 
vortex sheet to the surrounding potential flowfield. The fine- 
mesh solution with over 639 000 grid points confirms, but in 
greater detail, the presence in a medium-mesh calculation of a 
torsional wave superposed upon the vortex core as it approaches 
the trailing edge of the wing. The shearing of the flow 
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by the trailing edge causes this wave-like disturbance. 
3. PSEUDO-UNSTEADY METHODS FOR INCOMPRESSIBLE FLOW 


This section describes the pseudo-unsteady methods I use to 
solve the Euler equations for incompressible flow. It starts 
with a discussion of the artificial compressibility method, 
shows its generalization to the optimal system of Viviand, 
followed by a description of the computational method. 


3.1 Artificial Compressibility Method 


Practically all methods to solve the compressible Euler 
equations are based on the hyperbolic character of these 
equations. Unfortunately, if one tries to apply these methods 
to an incompressible flow as the limit of smaller and smaller 
Mach number, one finds that they do not work very 
satisfactorily. This is because the sound waves allowed in the 
mathematical model have very high speed and they dominate the 
system. If, however, only steady incompressible flow is of 
interest, the way around this difficulty is Chorin's artificial 
compressibility concept (see Chorin, 1967; Peyret and Taylor, 
1983; Rizzi and Eriksson, 1985). In this concept an 
artificial time-dependent term is added to the continuity 
equation so that the governing equations become 


an ap RAP əy l 
= + = ART E iS = = 
A at c div v O3; ot + (v grad) v + Po grad p= 0 


(3.1) 


where c is an arbitrary real parameter and (A is the constant 


density of the flow. This set of equations with modified 
dependent variables and scaled spatial derivatives has no 
physical meaning until the steady state is reached. However, 
when the steady state is reached the system becomes identical 
to the true steady equations. The still arbitrary parameter 


ee can be selected to accelerate the time decay to steady state. 
The major advantage of the modified system over the original one 
is that the high-speed sound waves have been eliminated, 
rendering the modified system much better conditioned for 
numerical solution. The advantage here is very similar to the 
removal of gravity waves by the geostrophic approximation in 
the equations of meterology. 


3.2 Well Posedness 


In three~dimensional Cartesian coordinates the conservation 
equations for the artificial compressibility approach to 
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steady incompressible flow are 


SET PME + He By mo (3.2) 
where 
P/o, ‘Cx OORO u v w 
q=| u =lo 100 f=|u +P/e, g=| uv h=| uw . 
v Q: OF ino! uv v'+p/0, vw 
w OP o ABE > ala b uw vw w'4p/o, 


The equivalent quasilinear form of system (3.2) is 


Be, cay W RARO 
e oe Tt Boy ot Cpe 2710 (S3) 
with 
oe Nono o o Pba 
asmin fi 2u 0 o pew.22 = jo v u o 
a aq 
o u O iv OF 2v 0 
o O u Oo Ow v 
2 
f OTRE o IN = AE 
cm. B o wo . 
ðq 
O Ow 
EO O72 


It is called hyperbolic at the point (x,y,z,t,q) if there 
exists a nonsingular matrix T(«,ß,€) that diagonalizes the 
linear combination D= QA+ßBB+€C 


1 Q) (2) EN 4, 


T DT= diag. {A 


where the eigenvalues A of D are real and the norms of T and 


rt are uniformly bounded for arbitrary real &,ß and €. The 
eigenvalues of D are found to be with the definitions 
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(2) (3) 


i =U,A ` = Uta, 


U= autfvtew and moe U RET a a THe), aa À 
Oa U-a and are always real. The range between the smallest 
and largest eigenvalues can be adjusted according to the value 
of the parameter c. Using matrix D's complete set of linearly 
independent right eigenvectors as the columns of matrix T we 
find 


Ovit G a -02a 

T= |-£ -B a(U+a) tac? u(U-a) +00? . (3.4) 
(ee v(U+ta) +807 v(U-a) +807 
a ° w(Uta) tear w(U-a) Teor 


Its inverse is formed from the left eigenvectors of D 


eU- (07467467) w wU+e e? B (wU+e c2) (au+Bv) U+ (a +82) c 


2 2 2 
aa a aa aa 
BU- (07487462) v ‘ vo+po7 (autew) U+ (a tetet > e (vu+Bc") 
aa ae a a a a 
ple _ _U-a a B € 
2a2c* 2a? 2a? 2a? 
_ Uta a E 
> paa os- oix 
2a°c 2a? 2a? 2a 


(3.5) 


Working with the maximum norm one can then go on to complete 
the demonstration of hyperbolicity by making reasonable 
estimates to show that these last two matrices are uniformly 
bounded if |u|, |v|, ana |w| are bounded. 


But how can we choose a numerical value for the parameter c 
in order to obtain a mix of wave speeds beneficial for numerical 
solution? To do so we need to look at a measure more 
quantitative than just the boundedness of the transformation 
matrices. In fact what we want to know is how the bound varies 
with c. This can be determined numerically for given values 


of Vv and c by computing the eigenvalues o of TT since the 
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1/2 S/2, 


=], 
L, norms are Ipl=(o +) ana Ir TEOT) . It is less 


cumbersome but sufficient for insight to consider only the 
two-dimensional problem, i.e. w=£= O. For any specified values 
of V and c a good measure of the condition of the system is the 


number K= Imi Itt 1/2 


= (Oe cia? - One can surmise, and we 
have verified it in an actual computation, that this condition 


number K depends only on the ratio ey. Computed 
numerically and plotted in Fig. 3 as the radial coordinate of 
the polar diagram (K,8) where the wave angle @ defines a= sin 0 
and = cos 0 for Os@s2Il, the condition number K is displayed 


for three different values of 2/vev. When the ratio rec*/v-V 

is greater than unity, the pressure waves dominate over the ~ 

convection waves and the system is less directionally dependent 
and better conditioned. 


Fig. 3 Polar diagram of condition number K of the hyperbolic 
system (3.2) as a function of the plane wave angle 6. 


3.3 Discontinuities 


In steady flow the true incompressible equations admit of 
course only the tangential discontinuity with jump conditions 
[pl= 0 and'y,.n = y,.n = 0. The artificial-compressibility 


however, approaches steady flow only asymptotically in time so 
we must investigate what transient discontinuities, if any, are 
allowed in this pseudo-system of equations. This aspect of the 
method has been overlooked, it seems, by Chorin (1967) and 
Peyret and Taylor (1983). We follow the standard analysis of 
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discontinuities for conservation laws (see Rizzi and Eriksson, 
1985) to obtain the jump relations across the discontinuity 
surface 


slp] 
sl val 


sf val =[ VY] 


2 
c [v 


[v7 +p] avalv ] + [pl (3.6) 


velvet valve] 


where s is the speed of the discontinuity in the direction of 
its normal n. Parallel to the surface is the tangential 
direction t; the asterisk indicates the average of the value on 
each side of the discontinuity e. g. vte (Va apid )/2, and the 

l 2 
square brackets their difference tyle Vv SNo - System (3.6) 
Zhe ot 
is linear homogeneous for given s and average velocities, viz. 


-s c o [p] 
1 2va-s o Iv] =0 (3.7) 
o ve Vi-s Evd 


and a nontrivial solution exists if 


(s-vž) (s? - 2sV* = ê) =0. 


The discontinuity therefore may move Aaa any of three 


ee speeds s = YI s vis (ve? +c yee or s, = 


SA 3 
vs (va? +c 23/2; The jump aR OEA for the first speed then 


are ETAt of (3.6) which work out to be 


[p] o 
[vl =K o with s= va (3.8) 
[ v,] 1 


and K is an arbitrary constant. We immediately recognize this 
solution as the unsteady tangential discontinuity that 
corresponds directly to the steady one. The second and third 
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eigenvectors associated with the speeds ETES and s=53 


é 


respectively 
[pl c? - 
[v] =K s (3.9) 
[v] sve (v* 2407) 1/2 


are more unexpected since they allow jumps in both pressure and 
velocity across a discontinuity. With S=S, the shock travels 


downstream, and the pressure and normal velocity of a fluid 
particle passing through it rises or falls together depending 
on the sign ofk. With 8=83 the shock moves upstream and the 


jumps in pressure and normal velocity take opposite signs. 
These last two (shock) discontinuities are of course 
nonphysical, but it appears that they do not alter the accuracy 
of the stationary solution because they are not allowed to 
remain part of any steady flowfield (Rizzi and Eriksson, 1985) . 


4. OPTIMAL HYPERBOLIC PRECONDITIONING 


Is artificial compressibility the only hyperbolic approach 
for incompressible flow? In an exhaustive study of this 
question, Viviand (1983) examines arbitrary time 


transformations represented by the matrix ut that leads to the 


Ppseudo-unsteady systems 


-loq 3 a Dirge 
M YS + Ont + y + Er o (4.1) 


where the dependent variables q are velocity and either static 
or total pressure or some function of all of these. Two 


essential constraints are set on the elements of ut, 1) 
system (4.1) must be hyperbolic, and 2) it must possess the 
same number of positive and negative eigenvalues as the real 
equations do in order to ensure that the physically relevant 
information is brought in by the boundary conditions. If this 
last constraint is not met, the possibility arises that the 
solution may satisfy the steady equations with incorrect 
boundary conditions. Viviand then finds that there are still a 
number of elements of M free to specify, and he goes on to 
define a class of systems that are optimal in the sense that 
their eigenvalues are all as nearly equal in magnitude as 
possible. Waves then all travel at about the same speed, and 
this may lead to more rapid convergence. In this sense M can 
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be considered a hyperbolic preconditioning matrix. 

Of this class of optimal systems I choose to look in detail 
at two specific systems (Viviand, private communication) which, 
to clarify the concepts, are presented here for two dimensions. 
The results do, however, carry over to three dimensions. They 
are 


a 4 
ze P/Po + 2v? div V - v-div (vv E P/P) = 


(4.2) 
-V + div(yy + I p/p.) - Y div y =0 
and 
a 
3 P p/P, +v div V = 
(4.3) 
E Pug 


They differ from each other in the way the pseudotransients of 
the pressure are driven. In Eq. (4.2) both the divergence of 
mass and momentum directly influence the pressure transient, 
while only mass does so in Eq. (4.3). Neither is strictly 
conservative, Viviand calls them semiconservative, because of 
the variable coefficients on the divergence terms. Adopting 


the total pressure P=p/p +1/2 v? insteady of the static 
pressure p as the dependent variable, Eq. (4.2) simplifies to 


& Pon v? div Vv =0 
(4.4) 
Ê- y + diviyy + T p/p,) -ydivy=0 
and is more convenient for numerical computation. 
In Cartesian coordinates these become 
Sat migita] aes (4.5) 


where f and g are defined in Eq. (3.2) and only q and M may 
differ. The quasilinear form with A=M" (3f) /(əq) and B=M- (3g) / 
(ðq) is 


3 chee dg _ 
gE Lt ARE TE BS O- (4.6) 
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4.1 Analysis of System (4.3) 


Equations (4.3) take the values 


3 = 
P/P s akal o Pig- ° v o ] o o v 

qj} u M= |-2u 1 0] A= {1 0 O]| B= JO v -u (4.7) 
v -2v7 01 O-v u LO 0 


and an analysis of the matrix D=aA+8B determines its 


eigenvalues to be a) caut Byn and a2, ny VEG MATE 


= +YV which are always real. In comparison to those of 
artificial compressibility, the range in their magnitudes is 
smaller, and the largest one is substantially smaller. The 


transformation matrices T and ro work out to 


oO v(U-yvV) = Vv(U+yv) 
T= |-B uy -aV -uy -aV (4.8) 
a vy -BV -vy -BV 


and 


Bu - av u(Bu-av) v(Bu-av) 


ee yv+u a(U_+yv) 8(U_+yV) 
2v 2y 2y . 


-yV+U a (-U+yV) B (-U+yV) 
2V 2y 2y 


Where the flow stagnates, V goes to zero and some elements of 
T become infinite. But the matrix can be bounded by 

2 2 š 
redefining V to be v7=max(e oY ) in the coefficient matrices 


A and B as demonstrated by the plot of the condition number K 
in Fig. 4. 
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Fig. 4 Polar diagram of condition number K of the hyperbolic 
system (4.3) as a function of the Plane wave angle 9. 


4.2 Analysis of System (4.4) 


If we carry out the same analysis of Eq. (4.4) with 


P v o o o v o o v 
q= ju M= |-u A= |1 (0) -v B= 
v Sy O SE Oo O u Sr auae: 
(4.9) 


we find that this system has exactly the same eigenvalues as 
system (4.3): U,+YV but a different eigenvector structure 


v? ywo w 
u 


a -q (4.10) 
v P <$ 


T= 
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and 
ie) -2By 2ay 
-l_ l av-ßu 
~ 2y(av-ßu) V pairin RECN Uy, 
aveu Bv-vy -aV+uy 


Fig. 5 Polar diagram of condition number K of the hyperbolic 
system (4.4) as a function of the plane wave angle 0. 


Figure 5 displays its boundness by the plot of condition 
number K. It is interesting to note that while all three 
systems (4.2), (4.3) and (4.4) possess the same optimal-size 
eigenvalues, they all have different eigenvectors. Even the 
simple change of dependent variable p to total pressure P in 
going from Eq. (4.2) to Eq. (4.4) results in a different 
eigenvector structure. But whether the wave patterns of one 
system offer more benefit to the numerical solution than 
either of the other two awaits discovery by actual computation. 
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4.3 Discontinuities 


Because Eqs. (4.2) and (4.3) are not strictly conservative, 
we cannot formally derive unsteady jump relations as we did for 
the artificial-compressibility method. If one supposes these 
variable coefficients to be locally constant, say, at their 
average values on each side of the discontinuity, then 
relations can be derived analogous to those of Eqs. (3.6) and 
(3.9) for exactly the same types, namely a moving tangential 
discontinuity, and forward and backward travelling shocks. 

How valid this sort of analysis may be is not of great concern, 
however, because only the steady state has any physical 
relevance. And once a stationary solution to any of Eqs. 
(4.2), (4.3), or (4.4) is reached, it does satisfy the steady 
jump relations 


[Iv] =o 
n 
aval] +[p] =0 (4,11) 


valv,] + valv ] = 0 


for a tangential discontinuity, the only physical one allowed. 
That aberrations in the travelling discontinuities, due to the 
semiconservative form, should distort the steady state then 
seems unlikely. This is the meaning of the term 
semiconservative. Viviand (1983) also points out that, although 
a strictly conservative class of pseudo-unsteady systems can be 
derived, it does not possess the optimal property of the 
semiconservative class. 


5. FINITE VOLUME SOLUTION OF PSEUDO-UNSTEADY SYSTEMS 


Let us see how well this theory for pseudo-unsteady systems 
works in actual computations, which is, in fact, the first time 
it is put to a numerical test. I shall compare two solutions 
for incompressible flow past a 70 deg. swept flat delta wing, 
the first is by the usual artificial compressibility system 
(3.2), and the second by the optimal system (4.4). These are 
solved numerically in the general finite-volume form (Rizzi 
and Eriksson, 1984) 


2 J a avol + mff H-en ds = 0 (5.1) 


where Bram TVn, uyon + p/o, nre Win p/o nee, Wy-n + 
p/p nee] is the vector flux of q across the surrounding grid 
of quadrilateral cells with volume VOL and vector areas 
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S_=SIxe +S + = 
Ss, Syt SIY e, + SIZ e , $= SIX e, + SIY e, + SIZ e, and 


$,“SKX e, + SKY ey + SKZ e,. To analyze Eq. (5.1) locally 


after semi-discretization using centred space differences, hold 
the metrics of the cell ijk constant to obtain 


da ~ ~ ~ 
at (q VOL) ijk + (A ôt B 85 +C 8) dijk? (5:2) 
E EE . 98S.) 
where A= M*—~——— , B= M*—~——_ , and C= Mey . 
aq aq aq 


5.1 CFL Condition 


In order to solve the hyperbolic system (5.1), assuming 
that under appropriate boundary conditions it does converge to 
a steady state, I straight-forwardly apply the time-marching 
finite-volume procedure developed originally for the 
compressibly Euler equations which uses an explicit three- 
stage Runge-Kutta-type time integration scheme (see Rizzi and 
Eriksson, 1984). In the absence of boundaries the usual 
linearized Fourier analysis of Eq. (5.2) specifies the limit 
on the time step for which the integration locally is stable, 
i.e. the CFL condition, is At<cFL/|\|max where the constant 
CFL depends on the particular multistage method that is used. 
A conservative estimate for the maximum eigenvalue |A|max of 
the 3D spatial difference operator for Eqs. (3.2) leads to the 
local-time-step condition 


VOL 


ABR ORLY rere earl | (5.3) 
ijk Ds (ores 2/2 ijk 
where 
j= |ustx + vsiy + wsIz| 
+ |usax + vsgy + wsuz| 
+ |usKx + vSKY + wsKz| 
s= (| stx| + |sax| + |sKx|)? 
+ (| sty] + [soy] + |sxy|)? 
+ (| szz| + |saz| + |sxz|)? Š 
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For the optimal system (4.4) a similar estimate gives’ 


VOL 


v:-s 


(S) 
At, jH ] 


ijk 
where Ea ee NTE oe Thus we see that optimal 
Preconditioning offers a time step that is 2 to 3 times larger 
than the standard pseudo-unsteady system. The computed results 
presented here have all been carried out using these step sizes 
in the local-time-step integration scheme. 


5.2 Boundary conditions 


Boundary conditions of course specify the particular 
problem and two different types are of concern here: flow 
conditions on a solid wall and at the farfield boundary of the 
mesh. Since the mesh is aligned to the wall, for the first 
type I set the velocity flux through the wall to zero and 
determine the pressure p on it from the normal component of 
the incompressible momentum equation V° (Vegrad) n = n-grad P/Po 
which is exactly analogous to the conditions used for 
compressible flow (Rizzi, 1978). When it is differenced to 
formally first-order accuracy, the pressure on the surface is 
deduced from the interior values. 


The farfield conditions are specified as a form of Engquist's 
hierarchical series of absorbing boundary conditions (see 
Engquist and Majda, 1978). The First Approximation of this 
theory reads that setting conditions on the characteristic 
variables, instead of the flow variables q, of the component of 
Eq. (5.2) normal to the boundary is maximally dissipative and 
therefore absorbs more energy. Selecting a,8 and e as the 
components of the unit vector normal to the boundary means that 
the eigenvalues à of D are the slopes in time of the 
characteristic surfaces in the direction normal to the boundary. 


The transformation matrices rt and T diagonalize this one- 
dimensional equation so that 


Fethste0 where 9=1""q and A=t™4p t=diag{, P ,,(2) (3), (4), 


The combination of boundary conditions determined from outside 
the domain and auxiliary conditions set from inside follows 
in the now-standard way according to whether the associated 
characteristic directions:enter or leave the domain. When uU<O 
set the three (corresponding to negative eigenvalues) ingoing 


o gta aa 9/4) 


characteristic variables 9 to their 
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freestream values, linearily extrapolate the third 9) from 


the computational field, and then solve for the original 


unknowns q= Td. At outflow it is 4) 


of undisturbed flow, and 9, pa 


that is given the values 


and Wi are extrapolated. 
5.3 Artificial Viscosity Model 


The rationale for introđucing an artificial viscosity 
operator into the numerical proceđure has been put forward 
before (Eriksson and Rizzi, 1984). To counteract nonlinear 
aliasing associated with centred differences, to control the 
cascading of energy from large scale to small scale, and 
perhaps even to ensure the existence of a steady state are 
among the reasons that bespeak such a model. The dissipative 
operator Dq that I add to the convective differencing of the 
incompressible Euler equations contain a linear fourth- 
difference term for each of the three spatial directions 
(see Rizzi and Eriksson, 1984). They are implemented with 
boundary conditions so that the complete operator has the 


negative semi-definite property (a'na) <O which has been found 
important for minimizing the occurrence of spurious errors at 
solid boundaries (see Eriksson, 1984). 


6. COMPARISON OF COMPUTED RESULTS 


The mesh around the delta wing is an O-O type constructed 
by Eriksson's (1982) interpolation method that places a polar 
singular line at the apex and a parabolic singular line at the 
tip of the trailing edge. The computation begins with the 
freestream flow as initial conditions on a coarse mesh that has 
40 cells around the half span, 20 each on the upper and lower 
chord, and 12 outwards, (see Fig. 6). This particular grid 
topology focusses cells along the leading and trailing edges, 
as well as the apex where the flow changes most rapidly. It 
requires, however, a slight rounding of the wing tip. 

Figure 7 presents an overall comparison of the time evolutions 
of the two solutions to system (3.2) and (4.4), respectively, 
given by the root-mean-square and maximum residuals of the 
time differences of p, and by the lift and drag coefficients. 
The high-frequency changes in the residuals of the optimal 
system (4.4) suggest more rapid wave propagation than for the 
artificial-compressibility solution (3.2). However, the 
comparison of pressure coefficients computed on the wing 
surface (Fig. 7) reveals that the vortex is not as fully 
developed in the optimal-system solution because its suction 
peaks are not as large. The low pressure near the leading edge 
indicates that the vortex sheet may not be developing properly. 
A preliminary explanation for this may be that the semi- 
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Polar singular 
line 


(a) 


Fig. 6a Main features of an 0-0 grid generated around a delta- 
shaped small aspect ratio wing. The polar singular 
line produces a dense and nearly conical distribution 
of points at the apex which is needed to resolve the 
rapidly varying flow there. 


b) 
Fig. 6b Three-dimensional view of the delta wing mesh. 
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Chord section 


Detail at 
trailing edge 


Detail at leading edge 
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(c) 


Fig. 6c Partial chordwise and spanwise views of the actual 
medium (80x24x40) mesh used for the computation of the 


70 deg. swept flat plate. 
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conservative system (4.4) does not capture vortex sheets as 
cleanly as the strictly conservative one does. 


1 
10" 1.0. ee 
T 10 RES ms s 
X 4 S g 
£ 10” RES, | C 0.3 6 
max 
10° 0.5 Co 
0 
@ o 400 800 0 400 800 
1.5 
3.0 
1 
10^ RES ims ia 
qin a 
= 1 2 
3 10" RES max 1.5 § 
107 0.5 
0 0 
(b) 0 200 400 200 400 
Cycles Cycles 
++ System (3.2) -—-—- System (4.4) 


Fig. 7 Comparison of time evolution of the coarse-mesh 
solutions to (a) system (4.4) and (b) system (3.2), and 


(c) comparison of the two computed surface pressure 
distribution. 
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Very little is known right now about the capturing of 
vortex sheets in numerical grid solutions, so it is difficult 
to come to any definite conclusion. Hoeijmakers and Rizzi 
(1984) carried out one of the first comparisons between a 
vortex sheet captured in a solution to the Euler equations and 
one fitted to a potential flow. That comparison, using a 
solution to Eq. (3.2), demonstrates that a stable vortex sheet 
can be captured which does agree well with the fitted potential 
one. The Euler solution, however, indicates a wave-like 
structure superposed upon the vortex core as it nears the 
trailing edge. A further question then arises, does the 
stability of the captured sheet, its close agreement in size 
and position with the fitted one, and the observed wave 
structure all change as the mesh size is refined? The fine- 
mesh solution presented in the next section attempts to 
answer it. 


7. VORTEX SHEET CAPTURED IN FINE MESH SOLUTION 


This section presents the conservative solution (3.2) of 
incompressible flow around the delta wing of zero thickness 
and unit length at 20 deg. angle of attack obtained upon two 
mesh densities. The first, the medium density, contains 
80x24x40 cells, and is the same one used by Hoeijmakers and 
Rizzi (1984). The second, the fine density has 160x48x80 
cells. Steady flow separates from the leading edge in a vortex 
sheet which then, under the influence of its own vorticity, 
rolls up to form a vortex over the wing. Due to the lack of 
experimentally measured turbulent flow data for this case, a 
comparison with measurements is not carried out. Instead the 
results are compared with those from Hoeijmakers's potential 
boundary-integral (panel) method which inserts a vortex sheet, 
adjusts it to the surrounding irritational flowfield, and 
allows it to rollup under its own influence for several turns, 
and then models the remaining core by an isolated line vortex 
(see Hoeijmakers et al, 1983, and Hoeijmakers and Rizzi, 
1984). The position and strength of the vortex sheet and 
isolated vortex are determined as part of the solution, 
sometimes termed "fitting" the rotational flow features. They 
are true discontinuities, infinitesimally thin, and for this 
reason a very good choice for comparison because the sheet 
and vortex in my solutions are not infinitesimally thin but 
smeared or "captured" over a number of computational mesh cells. 
The comparison therefore offers a good control on the position 
of the computed vortex and the diffusion of the sheet as well 
as how these change with mesh size. Furthermore such panel- 
method results have been found to agree reasonably well with 
measurements made in turbulent flow (see e.g. Hoeijmakers 


et al., 1983). 


220 RIZZI 


The thickness of the rotational flow features captured in 
the solution to the Euler equations varies directly with the 
size of the mesh cells. The simplest way, therefore, to 
minimize the diffusion of vorticity is to use as dense a mesh 
as possible. The fine mesh here consists of over 600 000 
cells. The computations were carried out on the CYBER 205 
vector computer in 32-bit precision at the rate of 6 microsec. 
per cell per iteration which translates to over 125 mflops. 
The medium-mesh solution was advanced an additional 2000 steps 
beyond the solution discussed by Rizzi and Eriksson (1985) with 
no discernible change found with time. That medium-mesh 
solution, therefore, is steady. It is interpolated to the 
fine mesh and then iterated for 2500 steps. 


Global features of the flow in the medium and the fine 
mesh are compared in Fig. 8 by isograms, drawn in plane 
projection, of the computed solution in three nonplanar mesh 
surfaces x/c= 0.3, 0.6 and 0.9 over the wing, one surface in 
the wake at x/c= 1.15 and one cutting axially through the core 
of the vortex. The isograms of ot vorticity magnitude Jal ' 


and total pressure coefficient (PEP) / (P. -P,,) in Figs. 8(a-c) 
o 


agree for the two solutions and reveal qualitatively the 
leading-edge vortex over the wing, as well as the trailing- 
edge vortex that develops from the trailing-edge sheet, 
interacts, and counter rotates with the leading-edge vortex, 
and prođuces the double-vortex pattern in the wake. This 
wake phenomenon has been observed in the wind tunnel (Hummel, 
1979). The comparison shows that overall the broad features 
of the flow are represented in both grids. The isograms 
viewed axially through the core indicate the approximately 
conical nature of the flow starting at the apex. In both 
solutions at about the 80% chord position, however, the 
leading-edge vortex lifts up slightly and an abrupt change 
takes place which might be interpreted as an additional, and 
unexpected, vortex phenomenon. Although the cause of this 
feature may still be numerical, it is known that the shear 
layer separating from the leading edge does show small-scale 
wave-like structure near the trailing edge (Hoeijmakers and 
Vaatstra, 1983), which could perhaps evolve into a second 
vortex that wraps up and intertwines with the primary vortex 


effect of the development of the trailing edge vortex. In any 
evert more of the details are brought out in the fine mesh. 
The contours of total pressure (Fig. 8c) display qualitative 
agreement with those observed in wind tunnel measurements. 

The computed losses in the core, however, are high, nearly 80% 
of the total pressure, and can be attributed to the numerical 
effect of capturing the vortex sheet. Theoretically the loss 
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Fig. 8a Isobars of pressure coefficient co 
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(b) 
Fig. 8b Vorticity magnitude (not same increment for medium and 
fine contours). 
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Total pressure coefficient — 
Pi- Pao 
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Increment =0.2 


Wake x/c=1.15 


(0) 


Fig. 8c Contours of total pressure coefficient. Increment 
= 0.4, 


Comparison of contour maps of the medium-mesh (80x24x40) and 
fine-mesh (160x48x80) solutions to the Euler equations (3.2) 
for flow past a 70 deg. swept flap plate delta wing. They are 
drawn in four non-planar mesh surfaces at the x/c = 0.3, 0.6, 
0.9 and 1.15 stations and in one mesh surface which passes 
approximately through the axial core of the vortex. M5. =.0, 


a = 20 deg. 
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should be zero (total pressure coefficient =1) on each side of 
the sheet, even though the velocity is in shear. The 
numerical solution has to support this shear with a 
continuous profile over several mesh cells through the sheet, 
and any sort of reasonable profile (say a linear one) 
connecting the velocity vector on one side with the one on the 
other side immediately implies a total pressure loss for the 
profile even if the velocities at both sides are correct. And 
the level of loss quickly mounts as the number of cells 
available tò support the profile diminishes. As the vortex 
sheet wraps up tighter and tighter there will be only one or 
two cells located between the coils, and ultimately it will 
disappear off the mesh completely. At that point it is not 
unreasonable to expect the amount of loss of total pressure 
that we see in Fig. 8c. The fine mesh supports more coils, but 
the sheet must eventually disappear in the same way off this 
mesh too, producing about the same loss at the centre of a now 
smaller diameter core. It is the size therefore, of the 
contour rings, but not its level, that varies with mesh 
spacing. 


Let us now inspect the solution more quantitatively by 
comparison with the results of the potential method. Figure 9 
presents the shape of the fitted vortex sheet (dashed lines) 
from the potential method superimposed upon the vorticity 
magnitude contours of the Euler-equation solution in three 
cross-flow planes. Bear in mind that the dashed lines are in 
the planes x/c= constant while the full lines are projections 
onto these planes of the vorticity contours in the non-planar 
mesh surfaces that intersect the wing at the corresponding 
value of x/c. We see that the vorticity captured in the field 
is diffused over 5 or 6 cells in both the medium and fine 
mesh solutions*, and that, in general, the vorticial flow 
region occupies a larger volume than that enclosed by the 
vortex sheet fitted to the potential solution.- But the 
positions of the vortex cores in the comparisons and even 
the curvature of the sheets agree remarkably well, except at 
the first station x/c= 0.3 where the vorticity contours are 
somewhat larger and more inboard of the fitted vortex sheet. 
The vorticity in the fitted sheet is largest near the leading 
edge where the curvature of the sheet is singular, and the 
Euler-equation solutions indicate the same trend. The sheet 
appears to depart tangentially from the lower surface of the 
leading edge. This comparison with mesh refinement confirms 
that a stable vortex sheet separating from a swept leading 
edge can be captured in the vorticity field of the Euler- 


i LO ae eee 
* the increment between isovorticity contours is not the same 
for the medium and fine mesh solutions. 
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equation solution with a reasonable degree of realism. The 
curious distortion of the contours in the x/c= 0.9 station 

and the associated islands of vorticity are another indication 
that a torsional wave may be standing on the core and giving 
rise to subsidiary vortices. 


Medium Fine 


0.5 
ysi) 


— Euler ---- Potential 


Fig. 9 Comparison of the vorticity fields indicated by 
vorticity magnituđe contours (solid lines) computed 
with the Euler equations (3.2), using the medium and 
fine meshes, and the shed vortex (dashed lines) that 
is fitted as a discontinuity to the surrounding 
Potential solution obtained by the 3D panel method. 
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An informative, and usually qualitative, manner to examine 
a flow over a wing is to look at its isograms on the wing 
surface. Figures 10(a-c) present such views together with the 
more quantitative graphs of spanwise distributions at three 
x/c= constant stations and compare them with the potential 
values. In the sets of computed isobars (Fig. 10a) the 
pressure trough under the leading-edge vortex has about the 
same shape, position, and width, and the three agree rather 
well. The peak level of the suction along the entire trough on 
the upper surface is somewhat lower in the medium-mesh Euler 
results, and shifted slightly inboard at x/c= 0.3, but the 
fine-mesh results show a trend towards the potential solution. 
The largest losses in total pressure coefficient (Fig. 10b), 
even negative values, are found near the apex. The fine-mesh 
results show a pronounced waviness that may be a reflection of 
the character of the vortex sheet as it approaches the trailing 
edge. This waviness, however, is not present in the 


circumferential (vw?) A velocity components on the upper 


surface (Fig. 10c). 
8. CONCLUDING REMARKS 


The pseudo-unsteady approach is an interesting one for 
solving the incompressible Euler equations because it is 
equally applicable in two or three dimensional problems and 
because its solutions provide a lot of insight into the types 
of flows that can occur. Whether the resulting hyperbolic 
system is strictly conservative, or only semiconservative, may 
influence how well a vortex sheet is captured. As a 
demonstration of the method, and as a benchmark solution for the 
testing of such methods a steady 3D flowfield with a free-shear 
layer has also been computed with a conservative system, and 
its features have been discussed under grid refinement. 
Comparison with an accepted solution was reasonable, even large 
(but local) errors in total pressure do not seriously degrade 
the global accuracy, and a curious vortex-like phenomenon 
coupled to a wave structure in the flow properties was seen to 
develop just ahead of the trailing edge. This feature may 
arise from a disturbance of the leading-edge sheet or it may be 
associated with the rollup of the trailing-edge sheet. 
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PARTICLE NUMERICAL MODELS IN FLUID DYNAMICS 


P.A. Raviart 
(Analyse Numérique, Université Pierre et Marie, Curie, Paris) 


l. INTRODUCTION 


By particle methods of solution of time-dependent problems 
we mean numerical methods where some of the dependent variables 
are approximated at each time t by a linear combination of 
Dirac measures in the space variables x. Although particle 
methods are far less popular than classical numerical methods 
such as finite difference or finite element methods, they play 
an increasing role in the numerical simulation of very complex 
flows where convection phenomena are dominant. In fact, vortex 
methods are now commonly used in the simulation of incompressible 
fluid flows at high Reynolds numbers. On the other hand, 
particle methods seem to be well suited to the computation of 
compressible multifluid flows. 


The purpose of this paper is to provide a fairly general 
introduction to the particle method which can be also presented 
as a generalized finite difference method using Lagrangian 
coordinates. The paper is organized as follows: Section 2 is 
devoted to the problem of approximation of continuous functions 
by linear combinations of Dirac measures (the particles). In 
Section 3, we consider general convection-diffusion problems. 
First, we show that the particle method provides a simple and 
effective way of dealing with the convection terms. Next, we 
introduce a deterministic method for approximating the 
diffusion term. In Section 4, we consider the Navier-Stokes 
equations for an incompressible viscous fluid. By applying the 
particle method to the vorticity equation, we introduce in a 
very natural manner the two-dimensional and three-dimensional 
point-vortex methods with a new treatment of the viscous term. 
In Section 5, we show how to adapt the particle method to the 
nonlinear hyperbolic system of gas dynamics. An extension of 
the method to a detonation problem is then discussed. 
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2. PARTICLE APPROXIMATION OF FUNCTIONS 


In order to introduce the particle method, we first need 
to discuss the representation of functions in Lagrangian 
coordinates. Given a velocity field mis u(x,t) = (a, (x,t) EAT 


u,(x,t)) yas (Xp e 1X) E€ Rt, t 2 O, we define the integral 
curves of the vector-field u, or equivalently the characteristic 
curves of the first-order differential operator 


a° 
ry Sala 


as the solutions of the differential system 


Q 


m= u(x,t). (2.1) 


d 
For all € € R , we denote by t > x(€,t) the solution of (2.1) 
which satisfies the initial condition 


x(0) =. (2.2) 
Setting 
ox, 
T(E,t) = det JA (E,t)), (2.3) 


a standard calculation shows that 
Chi 
pe (Et) = I(t) (Yeu) (x(E,t) ,t). (2.4) 
a 
Now, let g = g(x,t), xER , t 2 0, be a continuous function. 


For deriving a particle Spprowimation of the function g, we 


are given a quadrature formula over the space Be 


| W(E)GE = EF a, H(é.) (2.5) 
a er ees Si) 


where (85195) jeg is some set of points &, E rÊ and weights 


Ga > 0. By using the change ef variables % = x(£,t), we obtain 
for all continuous function ¢ = $(x) with compact support 


| g(xrt) > (x) ax -Í 9(x(E,t) -t)o(x(E,t))T(E,t) de . 
a d 


PARTICLE NUMERICAL MODELS 233 


Hence, ‘setting 


4, (E) = glg t) (2.6) 


and 
a, (t) ba T(E 57) ' (2.7) 


we have by (2.5) 


(t) t) (x, (t)). 


j 


| g(x,t) o(x)dx = £ ay (tg (x 
a ides: aay 


R jes 


In other words, we find that the function g is approximated by 
the following measure 


T glt) = E aj (Egla (tt) Slax, (t)) (2.8) 
je A A 


where ô(x) is the "delta-function", Observe that (2.8) is a 
Lagrangian representation of the function g: at each time 

t, g(+,t) is approximated by a linear combination of Dirac 
measures, or particles, which have fixed Lagrangian coordinates 


gy: 
On the other hand, given a set of particles j € J with 


positions x, (t) and weights ay (t), or equivalently the measure 


g' (x,t) = = by (t) 8 (rx, (t)), 
jeg aa 


we construct a smooth function in the following way. We 
introduce a sufficiently smooth cut-off function ¢ such that 


| % dx= 1. 
d 
R 


Then, for all € > O, we set 


1 
to (x) TAR &(x/e) 


and 


h 
ge(xrt) = Eb, (tye, (xx, (t)). 
pak Seton Win sense 
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This amounts to replacing each point-particle j by a finite- 
size particle whose shape is defined by the function cee 


By applying this procedure to the measure ates we obtain 
the function 


me Gert) = E a) (tigtx (t)). (2.9) 


(t) st) g (X-X. 
jeg 


j j 


In fact, g - me g may be viewed as some measure of the error 
between the function g and its particle approximation 1” g. 
Moreover, it is possible to derive bounds for g - D g in terms 


of the properties of the cut-off function Ç. Let us describe a 
typical result in that direction. We suppose for simplicity 
that 


a a d 
g=2Z, & = (j, h) 1% =h VWI= (jL,---i,)E2- 
zj å  lgisd’' `j 1 a (2.10) 
Then, we have (see(Raviart, 1983), (Gallic and Raviart, 1985a) 
for more details) :- 


Proposition 1. Assume that the velocity field u and the 


function g are smooth enough and that (2.10) holds. Assume 
in addition that the cut-off function ¢ satisfies the following 


Properties: 
(i) there exists an integer k z 1 such that 


a 
R 


By raba a 
4 a bax =O VB= (Bir. /BgEN, Ls|B|s k ~ 1; 


zi (2.11) 


(di) the function ¢ belongs to the space wrtsrt (24) for some 
integers m > d and s 2 0. 


Then we find for al, B€ N° with |8| =s 


a gem g) (x,t) = o(e* + ni/e™S (2.12) 
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In the above result, weet (pd) denotes the Sobolev space 


werent) = tye third; ye wtiR), lels) 


and we have set, if B = (By re++ Ba) E nå, 


lel 
= Big 2 
|B| = By +... + By ra =| B 


1 da 
ox, eee ox 


It follows from (2.12) that the interparticle distance h must 
be chosen much smaller than the regularization parameter £ in 
order to obtain satisfactory approximation results. 


3. PARTICLE SOLUTION OF CONVECTION-DIFFUSION EQUATIONS. 
Let us apply the above considerations to the numerical 


solution of convection-diffusion equations. We begin by 
considering convection problems of the form 


ə d 
ze ttug = g(xt.o), XER, t>0 
(3.1) 


where ġ = arere rth is a p-vector, g is a continuous function 


from Ro AIRO RP into RP ana V- (u ¢) stands for 


(Yela Oy) reeer¥e( O,))- 


We are looking for an approximate particle solution of (3.1) 
of the form 


B = 3 ; 3.2 
$ (x,t) = aj (E) g(t) 6 (x x (6) (3.2) 
where $5 (t) is some approximation of (x, (t),t). We first 


notice that we have 


E] h h d 
Sete eta a S aeg (ED Slet) (3.3) 
FEE gag E j 


& 
(aa 
te, 
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in the sense of distributions on Ro x ]o,o1. Indeed, let p be 


a Cc» function with compact support in Rt x ]o,-[ , we have by 
(2.1) 


da 
Fe lEt) t) = GE + wavy (Get) 


so that 


EJ 
=- E a, (tg, (t) (St + u. Yy) (x, (t) ,t)at = 
E ai Cie aton “he 


a a 
a z a, (t)o.(t) = wx, (t) ,t) dt 
j jes Eilat at j 
and (3,3) follows. Next, by the results of Section 2, we may 
write 
g(x,to(x,t)) = E ay (t) g(x (t) rt g(x, (t)ô(x-x, (t)). 
iie A jea a. wind ~ sj 
(3.4) 
Therefore, using (3.3) and (3.4), a (semi-) discrete form of 
Problem (3.1) consists in finding the functions t + $5 (t), jE, 
solutions of 


a 
at (a, $) = aj GH5rtrgs) 

(3.5) 
$, (0) s $, (85) Sew 


On the other hand, using (2,1), (2.2), (2.4), (2.6) and (2.7), 
we note that the functions t + x (t) and t + a, (t) are the 


solutions of the differential equations 
a 
| at 74 ~ uljet), 
(3.6) 
| IR 
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and 


a 
ae a = a, (Yeu) (x, ,t) 


j 
(3.7) 


a, (0) = aj. 


The particle method is thus defined by the equations (3.5), 
(3.6) and (3.7). It remains however to introduce suitable 
time-stepping in order to obtain a practically implementable 
scheme, 


Remark 3.1 Note that we need not solve (3.7) in the case of a 
purely convective problem (g = ©) since the first equation 
(3.5) reduces to 


a 
rs (a, ty?) =0. 


This means that Problem (3.1) is solved by moving the particles 
along the characteristic curves without changing their weights 


aj tol). 


On the other hand, in the case Y+u = O which occurs 
frequently in practice, (3.7) is trivially solved by setting 
a,(t) =a,. 


j j 


Remark 3.2 An equivalent way of deriving our scheme consists 
in noticing that (2.4) and (2.7) imply 


la 1 Bae 
mC a TL lug a A a A 


This amounts to writing first equation (3.1) in the Lagrangian 
coordinates (€,t) and using collocation method at the points 


gr Jeg. 
Remark 3.3 Consider the linear case where 
glxt,$) = G(x,t)¢ 


and G(x,t) is a p x p matrix. Then, it is a simple matter to 


check that our particle approximation li is indeed the unique 
measure solution of 
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Feet rca gh) = goute” 


$ (x,0) = Z a, fol? (x8) 


Hence, in the linear case, the’ particle approximation of 
Problem (3.1) is an exact solution of the convection equation 
corresponding to a perturbed initial condition. 


As we have noticed above, the particle method provides an 
easy and natural way of solving convection problems. We now 
turn to convection-diffusion problems of the form 


FETT N -VAgE glxtep, ERY t> o 
¢(x,0) = g(x) , (3.9) 


where the diffusion coefficient v > O is assumed to be small. 
Clearly, the difficulty stems from the particle treatment of 
the diffusion term. A first procedure proposed by Chorin (1973) 
uses a random walk method for solving the diffusion equation 


3 
ae $ 
and a splitting algorithm. We propose in this paper a 


deterministic method which is based on the notion of generalized 
finite-difference. 


- vAd=0 


Given a sufficiently smooth function Ro > R, we want to 
find an approximation of v(x.) in terms of the values vGq) of 


y at the particle positions Xk In fact, using a Taylor 


expansion at the point X50 one can prove 


‘Propgsition 2. (Gallic and Raviart, 1985b). Assume the 
hypotheses of Proposition 1 with s = 1. Assume in addition 
that the cut-off function 7 is radially symmetric, i.e., there 
exists a function  : R, > R such that 


t= gix. (3.10) 
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Then, if the function y is sufficiently smooth, we have 


DE (TR) e T g) 
)) E LE ES Sill gi 


PA ® a Yx) - w(x 
S ~j EETA 
kEJ = 
lx. - 5! 
(3.11) 
= Ay(x)) + o(e* + Koena 


In; (2s Dg (x) is the total derivative of the function 
p at the point x so that 


a ace a 
Dg. (x).y = 2 >= (x) y,, YER. 
een ws ie 1 Xs ~ i = 


Remark 3.4 One can slightly extend the generalized difference 
operator appearing in (3.11) in order to approximate any 
second order differential operator of the form V. (v V). 
Assuming the hypotheses of Proposition 2, we obtain ~ 


m, m+l 


= Ve (b Ww) (x) + ole + ntye™, | 


3 


Now, using (3.8) and Proposition 2, it is a simple matter to 
adapt the particle method to the convection-diffusion problem 


(3.9). Looking again for an approximate solution g” of the 
form (3.2), we define the functions t > $j (t), j E J, to be the 


solutions of the differential system 


Dg (x,7x,_) (x,_-x,) 

Aa ea aa E eee 
A et A 
m7 Hy 


miaj glžjrt;), 
(3.12) 
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The numerical simulation of the diffusion term is thus obtained 
by introducing interactions of particles. 


If we assume g = O, we note again that the convection terms 
are taken into account by moving the particles without changing 
their weights aj $ j: On the contrary, the particle treatment 


of the diffusion term consists in modifying the weights of the 
particles without changing their positions žy 


Remark 3.5. It may seem natural to choose for the cut-off 
function ¢ the Gaussian function 
2 
x 
o(x)= — ae exp(- dg ye (3.13) 
(27) 


This leads to a simple form for the first equation (3.12). 
Indeed, since in that case 


, 


2 
es x 
Dg (x)-y = —— r Xy exp (- ) 
€ (ano? gate * y E 


we obtain 


Ix al 
5) o- are ere 85 E kT #5) OP Bs ) = 


= g(x, rt+$5)- 


Concerning the convergence of this particle method of 
approximation of the convection-diffusion problem (3.9), we 
refer again to (Gallic and Raviart, 1985a). Using Proposition 
2, the convergence is proved under the additional hypothesis 


d- 
a © <0 420 
together with the error estimate 


(p - gh) (xrt) = ole? + hyt 


See also (Cottet and Gallic, 1985a) for a closely related 
result when the cut-off function ¢ is given by (3.13). 
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4. APPLICATIONS TO VORTEX METHODS 
Let us now apply the particle methodology to fluid dynamics 
problems. In this Section, we consider the two-dimensional and 


three-dimensional Navier-Stokes equations for an incompressible 
fluid flow 


3 
Fe Ut (wu +yp-v Ausf, Pen aaa t SO 
(4.1) 


Veu = 0. (4.2) 


We supplement these equations with the following condition 
which means that the flow is uniform at infinity 


u(x,t) +u (t) , |x| >e (4.3) 
and an initial condition 
u(x,0) = u- (4.4) 


We begin by analyzing the two-dimensional case. We set 


ðu, du. 
w= ourl u = g-a (4.5) 
3 1 2 


and we assume for simplicity that curl £ =O. By taking the 
curl of Equation (4.1), we obtain the following equivalent 
formulation of Problem (4.1), (4.2), (4.3), (4.4): 


32 iy (uu) - v dw = 0 mE n tem O 
(4.6) 
w(x,0) = w(x) BeN Wy = curl wor 
curl u= w 
EERI >r 
(4.7) 
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Let us introduce the kernel 
1 


K(g) = ——> 
an|x|? 


(= Xr x). 


Then, it is well known that the equations (4.7) may be 
equivalently written in the form 


ule ,t) = u, (t) + Kxw(+,t). (4.8) 


Hence, we obtain a convection-diffusion problem (4.6) for the 
vorticity w but here the velocity field u is a function of w 
itself through the equations (4.7) or (4.8). 


The vortex method is then obtained by applying the particle 
method developed in Section 3 to the convection-diffusion 
problem (4.6). We are looking for an approximate vorticity of 
the form 


wt) = E a (tw 


6 (x-x, (t)). 4.9 
aes 5 4%) ET )) (4.9) 


Since the convolution operator K* does not operate on measures 


and therefore on we. ,t) , we need first to regularize it before 
using (4.8). Setting 


Ko R G (4.10) 
we define an approximate velocity field ue by 
h h 
uo (ert) = u(t) +K * w (t). (4.11) 
Since Tar = 0, we have 
a, (t) =a, (4.12) 


J 
and the vortex method is defined by the equations 


d x =u jt) sult 2 aw, Ko, - x) (4.13) 
€ 
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For the convergence of the two-dimensional vortex method 
in the case v = 0, i.e., for the incompressible Euler equations, 
we refer to (Beale and Majda,1982a and 1982b), (Cottet, 1982) 
and (Raviart, 1983). 


Remark 4.1. There are other ways of dealing with the 
diffusion term in the vortex method. For a discussion of this 
topic, see the survey of Leonard (1980). See also (Beale and 
Majda, 1981) for a mathematical analysis of Chorin's random 
walk method. 


Remark 4.2. One can choose other forms than (4.10) for the 
regularized kernel K_ which may be simpler to use in practice. 
For instance, one can take 


Kik) reece | Gaal SIP RN, 
“e~ 21x? + €°) paar a U 


On the other hand, instead of using (4.11), one can solve 
directly the equations (4.7) in order to define an approximate 
velocity field. If we introduce a stream function y such that 


u = curl yp = (ee eos 1 
~ ~ ox, Ox, 
the equations (4.7) become 
2 
Bete es g KE Rey E> 0; 
(4.15) 
| curl y(x,t) u(t) , |x| > 


Roughly speaking, the vortex in cell method consists in solving 
numerically (4.6) by the particle method and (4.15) by a 
classical finite-difference or finite-element method. For a 
precise description of the vortex in cell method, see again 
(Leonard, 1980). A proof of convergence of the method may be 
found in (Cottet, 1984). 


Let us give a simple interpretation of the particle 
approximation of the diffusion term in the vortex method (4.13), 
(4.14). Setting ta = n At, we consider the following time- 


stepping 
BFL'ESR n BT: A 
A EAS Atu (t) + A ak Yk aa x) > (4-16) 


244 RAVIART 


D% (x - Xk ). (x, ak i) 
n+l n n n (ahi = ~ os 
= + 2v At = (wo - w) . 
°5 GERi EAA E ie a fu 
(4.17) 


By choosing as in Remark 3.4 
at pi 2 
zx) = 5 exp (- llo 


the equation (4.17) becomes 
[att Me “ag 
uj = u + SE = a, (up ah) epl- —— ~) . 
TE rex J 2e 


Now, if we take 


E = V2W AE y (4.18) 
we obtain 
x2t1l n 
n+l ag Žij S 
j anv bt Z %% % exp ( a at a 
(4.19) 
| n+l ae? 
1 x 
tU- mvae I epl 4y At RF 


The fully discrete vortex method (4.16), (4.18) may be 
viewed as a very natural splitting algorithm which can be 
described as follows. We start from the particle approximation 


x n ee) 
des te a 6(x x) 


of w(-,t ). Then: 
n 


(i) the vortex particles are moved during the time interval 


(ty toe) according to the equations (4.16). This amounts to 


solving the incompressible Euler equations 
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aw 
ae t Ye (uw) =0 


u(+,t) = u (t) + K *w(-,t) 
by using the vortex method and a forward Euler time-stepping. 
We obtain a new vortex dirstribution 


Dios we 8(x - xy"). 
jeJ 


(ii) We solve exactly the diffusion problem 


am Sit 
| at v Ağ=0, 
< 
lxt) = 3 a w S(x - ae 
jE 
on thè interval (t, r toa: We find 
n+1)2 
s l n |x we | 
O(x thy) T ary be oe Deen nae 


(ii) we approximate the continuous function lta a) by a 


particle distribution 


= oF se (x - PE) 
jer 
using the new positions ae of the vortex particles. Instead 
of choosing 
pet att |? 
MEL ETEL a 1 n peers) ~ 
way 70h A PERE Di Cie eae av At 1 


we define Uri by the formula (4.19) which includes a small 


J 
correction term ensuring that the highly desirable conservation 


property 
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holds. 


(iv) Time is advanced by At and one goes back to the step (i) 
of the procedure. 


For the convergence of this splitting algorithm, we refer 
to (Cottet and Gallic, 1985b). 


We now turn to the three-dimensional case. We set 


w=VxXxu (4.20) 


and we assume again for simplicity that Yxf£=0Q. By taking 
the curl of Equation (4.1), we obtain the equivalent 
formulation of the Navier-Stokes problem (4.1) - (4.4): 


(ə 
FEL +V lu w-(Y Dy -vdw=o, XER, t>o 
(4.21) 
alx) =w D E E a 
Vxusw 
ME E iat 3 
ER ASE O 
Veu=o (4.22) 


u(x,t) >u (t) , |x] >e. 


ðu 
In (4.21), V u is the 3 a 3 matrix-valued function (Ea 


j 1si,js3 


If we introduce the matrix-kernel 


we obtain that the equations (4.22) may be replaced by 


u(-,t) = u (t) + Ke w(.,t). (4.23) 
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The three-dimensional vortex method is based on the 
formulation (4.21), (4.23). Note again that (4.21) isa 
convection-diffusion problem that we have already considered 
in Section 3. The numerical method is then defined as follows: 
We look for an approximate vortex distribution of the form 


Pt) = 2 aj w(t) (x = x, (t)). (4.24) 
jEJ 
Setting 
Ko = Keto (4.25) 
and 
h h 
=u tK *w su + I a, K(x- x,).w, , (4.26) 
iene SE R jE j xe aa) 


we define the vector-valued functions t > z(t) and t > g; (t) 


to be solutions of the differential systems 


a we 
ae 35 = Beet) (4.27) 
and 
a € 
ELR (y Y “2, 


The convergence of this three-dimensional vortex method has 
been recently established independently by Beale (1985) and 
Cottet (1985) again in the case v = 0. 


5. APPLICATION TO COMPRESSIBLE FLUID DYNAMICS 


The particle methodology can now be extended to the 
compressible gas dynamics equations 


20 w + Ve(p ww) + Yp=0 xer, t>o (1) 


" 
o 


2 e) + V-((p e + p)u) 
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In i(5 <2) je seer $ lal? where € is the specific internal 


energy and e is the specific total energy. In order to close 
the equations (5.1), we need to add an equation of state of 
the form 


p = p(p,é). (5.2) 


A particle-in-cell method of solution of the equations 
(5.1), (5.2) has been introduced by Harlow (1964) for the 
numerical computations of compressible multifluid flows. In 
this method, particles are used in order to simulate the 
convection terms and a grid is used for computing the pressure 
forces acting on the particles. Recently, Gingold and 
Monaghan (1983) have proposed a pure particle scheme, the 
smoothed particle hydrodynamics (S.P.H.) method where the grid 
is eliminated and the pressure forces are easily put into our 
particle framework. We refer to (Ovadia and Raviart, 1985) for 
a fairly general presentation. 


First setting 


p 
$= | eu (5.3) 
pe 
and 
o 
POs) 
£(o) = | : , LEigd, £(G) = (£,(g),-0-r£,(9)) » 
el a ED = (£,( veer Egg 
pô, 
ta (5.4) 
Pu; 


the gas dynamics equations (5.1) can be written in the form 


seo t+ V+ (u 9) +v.) = 0 (5.5) 
with 


u = u(>) = (pu)/p- (5.6) 
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Again, we look for an approximate particle solution of the 
form 


Ponti = T a, (t)¢, (t)8(x = x, (6). (5.7) 
LON sen 


The difficulty lies in the particle discretisation of the 
pressure term V-f(ġ). Given a smooth function p : Ro > R, we 
observe that, under the hypotheses of Proposition 1, we have 


ae 
Ey BR) FH) Tee Oe > 
i (5.8) 
fy kyo 
Te es mi ) 


Bi 


Now, using (5.8), we define the functions t > $5 (6) jE, 
to be solutions of the differential system 


a Ht j 
at 2) + Š 2 yee ek ee Elg TIS vo (x, x) = Q, jEJ. 


(5.9) 


Remark 5.1. Note that the form of the particle approximation 
of the pressure terms leads to a conservative method. In fact, 


assuming that 


a 
o(- x) = (x) , xER , 


we obtain 


which is precisely the required conservation property. 
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If we replace in (5.9) ¢ by its value (5.3) and if we set 


(ow), TAE eE ay aii Oye 


the system (5.9) gives 


a te 

at (a, P3) =0, 

a = 

at (a, Gabel og Oe a a, (Py, + BS) VE, (x, -Xx =O, (5.10) 


d . aa = 
ac (a, Pj e;) + aj £ ak (Py Y&T P, uy) Yee (x, xX) = 0. 


The first equation (5.10) simply means that the mass of the 
particle j 


m, =a, Pp, (5.11) 


is independent of t. Then, the other equations (5.10) can be 
equivalently written in the form 


a 
Z u, + y =- =0, Beil 
ae 34 a (Pp, P,) Wo, (x, x.) = 0 ( ) 
a 
ae Se (p, a PAR: AA te, = C= Os 
at “3” pes 95 Pk ete By By be eT Ae 


(5.13) 


Hence each particle j has a mass m,, a velocity u, and a 


j j 
specific total energy as Its motion is defined by 


ao 
at ~j 
and it satisfies the equations (5.12), (5.13) together with 


j 14 
= (5.14) 


= p( ) €.=e eR 
pj = ple) + ej =e ily 


K 
j j j s] 


. (5.15) 
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It remains only to specify si or equivalently p,; we take 
J 


= = - x). : 
SP e (5.16) 


Remark 5.2. One can also derive slightly different particle 
approximations of the momentum equations and the energy 
equation which can be easier to use in practical computations. 
For instance, one can replace (5.13) by 


=e, + I (p, u tp AA D (x, = )=0. 

yes 95 įk 8 79 a Sk) oe 5 7 
Together with (5.12), this gives a very simple equation for the 
internal energy 


2 5 E(w) ETA (x, EE D 
E Ae E A AA A E e AA 


The numerical method is completed by introducing a leap-frog 
time-stepping and an artificial viscosity technique to avoid 
nonlinear instabilities. For details, see again (Gingold and 
Monaghan, 1983) and (Ovadia and Raviart, 1985). Numerical 
experiments show that the method is robust and accuract for a 
reasonable number of particles. 


Let us point out that putting more physics in the numerical 
model is generally an easy matter with the particle method. 
As an example, following (Bourgeade, 1985), we consider a 
combustion problem where, for simplicity, we neglect heat 
conduction. This amounts to add to the equations (5.1) a 
chemical reaction equation of the form 


3 2 
ge (02) +Y (pz = pr , (5.17) 


where Z is the mass fraction of unburnt gas and r = r(p,p,Z) is 
the reaction rate. In order to close the system of equations 
(5.1), (5.17), we usually need an equation of state of the form 


e€ = €(p,D,Z)- 


In practice, such an equation of state is frequently unknown. 
In fact, the knowledge of the equations of state of the unburnt 
gas and the burnt gas alone enable us to use the particle 
method. 
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We consider the fluid as a mixture of unburnt (2, = 1) and 


burnt particles (Z, = 0) and we set 


j 
h m, 
Z(xt) = E 2, t (x - x, (t)). (5.18) 
jeg Pj J a 


Next, we use a random method for describing the combustion 
process. At time jal the probability for an unburnt particle 


j to burn between a and ti Ne is approximated by 


1 


n 


At rle 


ee ENE: -Wh +} 
1 Pje z (257 t,))- 
Using this "probability law", we decide at random if the 
particle j burns, we add to its internal energy the energy 
liberated by the combustion process. We thus obtain a 
numerical method which is easy to implement. 
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COMPRESSIBLE LAGRANGIAN HYDRODYNAMICS WITHOUT LAGRANGIAN CELLS 


R.A. Clark 
(Computational Physics Group X-7, 
Los Alamos National Laboratory) 


1. INTRODUCTION 


Traditional Lagrangian hydrodynamic codes for time dependent, 
compressible, multimaterial problems in two dimensions use the 
same general method. A Lagrangian mesh is defined which moves 
with the fluid and this mesh defines a set of Lagrangian cells. 
The mass in each cell remains fixed and the motion of the mesh 
determines the volume and hence the density of each cell. 

These methods work well until the mesh becomes distorted due to 
shear or turbulence. Large distortions cause computer codes to 
quickly grind to a halt. 


The usual solution to distortion is to "rezone" the mesh. 
Here we move the mesh points artificially so as to reduce 
distortions and then map the quantities from the old to the 
new. This results in unwanted diffusion of mass, momentum and 
energy throughout the mesh. Even with rezoning, few Lagrangian 
codes can handle more than limited distortions. Recently, 
what we call "Free-Lagrangian" codes have been developed 
specifically to handle large distortions. These codes, in 
addition to adjusting the mesh points, can reconnect mesh 
points, thus creating new cells. While Free-Lagrangian codes 
can handle virtually any distortion, they are even more 
diffusive than rezoners. 


We are trying a different approach to the problem. We 
abandon the idea of Lagrangian cells entirely. In the next 
section we will discuss how the conservation equations can be 
solved directly without resorting to Lagrangian cells. Next 
we will give some examples of calculations using this method. 
Finally, we will give details of the calculational method 


presently being used. 
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2. SOLVING THE CONSERVATION EQUATIONS 


The equations we are trying to solve can be written 


D r= D > 

Bee = př. (2.1) 

poe 7 eens 

2 t= ah (2.2) 

De: SiE > 

ae a i (2.3) 
P = P(p,e) (2.4) 


where Ù represents the vector velocity, p the density, e the 
specific internal energy and P the pressure of the fluid. 
Equation (2.1) expresses conservation of mass, (2.2) 
conservation of momentum and (2.3) conservation of energy. 

The Lagrangian time derivative, i.e., the derivative following 


the fluid, is indicated by = $ 


In a standard Lagrangian calculation only equation (2.2), 
the momentum equation is solved directly. The procedure is to 
integrate (2.2) over some region of space to arrive at the 
acceleration of each mesh point. The mesh points are then 
moved and the new cell volumes along with the fixed cell mass 
determine the new density, hence indirectly solving equation 
(2.1). The associated Pdv work term updates the cell energy 
and indirectly solves equation (2.3) and the new pressure is 
obtained from the equation of state (2.4). 


We propose the following: Instead of Lagrangian, cells, we 
think of a set of Lagrangian points which are embedded in and 
move with the fluid. There is no mass associated with these 
points. They are just moving tracer points at which we will 
attempt to keep track of the velocity, density, energy, and 
pressure of the fluid. In our later example calculations we 
will show point positions at various times in the calculation. 
At each of these points, we know the density, energy and 
velocity of the fluid, but we do not associate any particular 
mass with the point. 


Looking now at equation (2.1), we note that to approximate 
the time integral of the density change from time t to time 
t + ôt we need an approximation to - U at that point. To 
solve equation (2.2) we need an approximation for VP, and for 
(2.3) we again need . U. To obtain these we select a set of 
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"representative" neighbours. We then make a finite difference 
2 > > > 

approximation to VP and V . U using these neighbours, and 
> 

update p, U and e at each point. Each point is then moved the 


> 
distance U ôt and one time step is completed. 


At the next time step the selection of a set of 
"representative" neighbours may change, but this does not 
require any sort of re-mapping of variables. It only means 
that a different set of points will be used in the next finite 


difference approximation to q . Ù and Ye. Large distortions 
in the flow will produce frequent changes in neighbour 
selection, but since there are no cells to distort and no 
re-mapping to be done the calculation proceeds from cycle to 
cycle with no difficulty. 


3. SOME EXAMPLE CALCULATIONS 
3.1 Spherical example 


Here we will give three examples of calculations 
performed by the code HOBO using the free Lagrangian method 
described herein. In the first test problem, the initial 
condition is a sphere of perfect gas with a gamma of 5/3. The 
gas is divided into four regions as seen in Fig. 3.1. Pressures 
are in megabars, density in gm/cc and dimension in cm. 
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The high pressure in region IV will drive a spherical 
implosion which will greatly compress region III, II, and 
particularly I. There are two challenges to this problem, the 
first is to maintain a spherical ball while running the 
calculation in cylindrical (r,Z) geometry. Six snapshots of 
region II are shown in Fig. 3.2. Region I is interior to 
region II. The minimum volume of region I occurs in the fifth 
snapshot after which region I begins to expand. We ran 1000 
calculational cycles with 73 points in the radial direction and 
64 points covering 180° of angle. The left half of the 
snapshot is a reflection of the right half which was calculated. 


0 wlan, 
\ My 


“my ww 
Mn 


t=0.0 t=.049 t=.099 t=.150 t=.185 t=.289 
Fig. 3.2 


The second challenge is the accuracy of the solution. For 
comparison purposes we ran a standard one-dimension Lagrangian 
code using 800 zones, 100 zones in each region. In Figs. 
3.3a, b, c, and d we have plotted the average density and 
average specific internal energy in regions I and II as 
calculated by HOBO with 73 points in the radial direction and 
the one-dimensional Lagrangian calculation with 800 points. We 
feel the agreement to be quite good. One notable difference is 
the time at which minimum volume is reached. HOBO is slow by 
about .0075 usec or 4% of the problem time at that point. 

Since average density and energy are integral quantities we 
have plotted one of the variables as a function of radius in 
Fig. 3.4. We chose radial velocity, but the agreement in all 
other variables is very similar. The plots are from slightly 
different times to compensate for the time shift just 
mentioned. The 1D Lagrange plot is at 2.125 usec and the HOBO 
plot is from 2.25 usec. Apart from the inability of the more 
coarsely zone HOBO to resolve the shock front at the radius 

1.2 cm we feel the agreement is excellent. The time chosen for 
the plot is late in the calculation when region II has expanded 
almost back to its original volume. 


LAGRANGIAN HYDRODYNAMICS WITHOUT CELLS 259 


a- REGION 1 3005 REGION 2 
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000 005 o da5: 020, T03 TIME (MICROSEC) 


Figs. 3.3 a, b, c andd 
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3.2 Meshkov Instability 


For our second test problem we have chosen a Meshkov 
instability based on the geometry used in one of Meshkov's 
experiments. The initial conditions are shown in Fig. 3.5. A 
piston driven shock is driven through a region of air and then 
helium. The air to helium density ratio is just over 7. There 
is an initial perturbation in the air -- He interface which 
grows with time after the shock passes through the interface. 
In Fig. 3.6 we plot several snapshots of the Lagrangian point 
positions in the air (the He is not plotted). For comparison 
purposes we ran the same problem in a two-dimensional Eulerian 
code with the cell size similar to the point separation used in 
HOBO. In Figs. 3.7 a and b, we compare the size of the 
perturbation as it grows in time. In 3.7 a the initial 
perturbation, 6, is .2 cm and in 3.7.b it is .4 cm in width. 
The agreement between the two codes is excellent. 


33.8 cm 12.675 cm 16.9 cm 


air y=1.4 air y= 1.4 He y= 1.63 
P= 1.8289 P = 1.01325 P = 1.01325 


Q= 1.795 x 10° o = 1.184 x 10? \ o = 1.664x 107 
U= 15.31 u=0 u=0 


Fig. 3.5 
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Fig. 3.6 
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Fig. 3.7 a and b 


3.3 Rod Penetration Problem 


Our third test problem is the penetration of a concrete 
plate by a steel rod moving at an initial velocity of 
.2134 cm/usec. The rod is 9.066 cm in diameter and 45 cm in 
length. The concrete is 50 cm thick. In Fig. 3.8 we show 
six snapshots of the rod penetrating the concrete. 
Incompressible theory (Birkhoff and Farantonello, 1957) 
predicts a constant time rate of change in the length of the 
steel rod. The sound speed in the rod is .4545 cm/usec and 


EN = .22, so this problem should not be too far from the 
incompressible solution. As shown in Fig. 3.9, the rod length 
as a function of time matches the incompressible theory very 
well. Calculations with a two-dimensional Eulerian code 
produced an almost identical result. 
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4. THE FINITE DIFFERENCE SCHEME 


4.1 The pressure gradient 


= 
We want to approximate VP at the point k where neighbours are 


the points kyr ky eee eee Our neighbour selection 


guarantees at least three neighbours for each point, the 
average is six and there is no maximum number. Clearly there 


are many methods that could be used to approximate Ve. The 
following was arrived at through trial and error and appears 


to work very well. 


ek: ok, 
P: P, 
PLS P, P, PLS 
Pa 
ok. 
Fig. 4.1 


Consider point k in Fig. 4.1 that has five neighbours. We 
construct a polygon with vertices midway between the point and 
each of its neighbours. The position of the nth vertex is 


Z = 1/2 (xk) + x(k,)] and th from %, to x. i 
x x(k) + x( a) an e vector from x, to x, is 


> = > 
denoted by 6x, te a x The pressure at the nth vertex, Pa’ 
is a weighted average of Pik) and P (ka) (to be described in 


section 4.3). We assume a linear pressure distribution along 
each edge of the polygon and integrate the pressure over the 

> 
surface to get a force F. We assume a constant density 8k over 


> 
the polygon to calculate a mass M. Then we have os U = n° 


> > > 
Now let x'a =X, E cox, and the pressure at the new vertex 
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xX > 
is Pr = Bee E (Pa = PL) Now F and M are functions of € and 


we compute 


lim F(e 
e>0 M(e) ` 


The resulting expression for the pressure gradient is 


(4.1.1) 


where x and y are respectively the unit vectors in the x and y 


directions and ôx = ôx % + ôy y. 
n n n 


If the preceding is done in cylindrical geometry, the result 


is identical for vp with x and y replaced by r and z. It is 
of interest to note that if the iin is not taken the result 


does not give a spherically symmetric pressure gradient in a 
spherically symmetric problem using cylindrical coordinates. 


There is an easier way to arrive at equation (4.1), although 
the method just described is how we originally derived it. 
Since it takes only three points to describe a plane surface, 
each consecutive pair of neighbours along with the point k 
defines a pressure plane to first order. If we assign a 
weight to each of these approximations we have an approximation 


for Ve. If the weighting function is the area of the triangle 
formed by the three points, the result is the same as Eq. (4.1). 
We have tried other weighting functions, 9 and sinO where 0 is 


the angle between ZA and Cae and both work fairly well, but 


area weighting appears to be best at this time. 
4.2 The divergence of the velocity field 


In cartesian coor dinarna we represent the velocity at the 


point k by Ù = uy s = Vk z- The divergence of the velocity 
field can be SeSi as y . t= F x here V is the specific 
volume of the fluid. Referring back to Fig. 4.1 the specific 
volume of the constructed polygon is proportional to the area of 
the polygon given by 
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A= 1/2 a (x 


n+l i xa (Yaar Tee 


n 


Hence we can write 


q 3 a L OA Ps f a) (Yor i Yp? S T g *,) (Waar = vi) . 
r A dt a TTT a 


(4.2.1) 


a Xa+l Ya Yn+l Xa 


Equation (4.2.1) can be derived directly from equation 
(4.1.1) by noting that (4.1.1) implies a definition for the 
> > 
operators a and = and when these are applied to V . U = 2 
+ = equation 4.2.1 is obtained. Thus we have in effect three 


ways of deriving the same finite difference approximation to 


the operators a and a . In cylindrical coordinates we 


dy 
express the divergence of the velocity field as 
a a ae a eR E N 
z ox ə rer dz 


ð 
where = i x is calculated by equation (4.2.1) with x,y 


replaced by r,Z. 
4.3 The midpoint pressure and velocity 


In equation 4.1.1 we use a pressure Pa which is midway 
between points k and ka This is not a numerical average. 


Consider the one-dimensional problem depicted in Fig. 4.2.a. 


What pressure should we use for pi = Pei? If we use the 
average, 1/2 (Py + P,) the acceleration at i+l will be much 


greater than at i. However, we know that the velocity should be 
Continuous across the discontinuity. Given equal zoning the 
boundary pressure which gives equal acceleration to points i and 


A + 
itl is P, = (P; Ping * Pag? y)/ (0, + Psa)? 


It can be shown that the resulting finite difference 
approximation ay = (N = A) /8x is second order accurate when the 


density is continuous. 
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Fig. 4.2.b 
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Now consider the problem depicted in Fig. 4.2.b. Here we 
have a heavy material on the left moving into a very light 
4 = 
material on the right. What should we use for Uj = Uia? z£ 
we use the average 1/2 (U + Usa)? there will be a very large 


rate of compression in region 2 which is incorrect because 
region 1 is moving into a near vaccum. The quantity that 
should be continuous is pressure. The velocity which causes 


equal pressure increases at points i and itl is ut = 
2 2 2 2 7 
[ (pe put (pe bei a lA (pe i + (pe del: This assumes 


the sound speed c is a constant. Again it can be shown that 
the resultant finite difference approximation to ui is second 


order accurate if pë? is continuous. 


The midpoint pressures used in equation (4.1.1) are inverse 
density weighted and the midpoint velocities in equation (4.2.1) 


are pee weighted. 
4.4 The artificial viscosity 
An artificial visocsity q is added to the midpoint 


pressure in equation (4.1.1). It is quadratic in space form. 
Let U, be the closing rate between points k and ka’ i.e. 


c & HR 

7) 

te Ee > n 

|p = (Uy = Uk Fk z> . 
n ke Tk 

n 

Then let q, = A u and q, = ace u2 In the spirit of 
k kc ka Ee R 


paragraph 4.3, we inverse density weight the two to get our 
expression for the midpoint q, i.e. 


sper) 
= 2a Uy / a/p% + Me . (4.4.1) 


In all of our example calculations in section 2 we used 
a = 5.76. Now we must fold q into the internal energy 
equation in which we need to evaluate (P + q)V¥.U. Our 


a 
approximation for v.0 is given by equation (4.2.1). The q 
term is brought inside the summation so that 
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L(P, + = - 
Blige cnn men Wenn AER Sn) nad naa) 


= = 
*at1 Yn Yn+1 “n 


(p+q)V.U = 


(4.4.2) 
4.5 Prevention of density striations 


The method so far described has one remaining difficulty. 
By having all of the variables centred in space it becomes 
impossible to detect a sawtooth type wave as depicted in one 
dimension in Fig. 4.3. 


*i-2 Xie z Xi+1 Xi+2 


Fig. 4.3 


If such a wave develops it cannot be detected by a centred 
difference scheme. To correct for this, we define an 
artificial velocity u' as depicted in Fig. 4.3. We use our 


calculated v P to extrapolate from point k to point ki giving 


pt _p + (k -X).VP,. If the pressure field is linear 
= = . . re 
k aane Py k ere ae ji 
n n 
ext 
then Pk =P_. If they are not equal, there is a second 
n n 


derivative in the pressure field which we attempt to reduce. 
Physically what should happen is a velocity would be produced 
at the midpoint as indicated in 4.3., which would be produced 
point i and compress point i+l. This velocity must be 

2 
proportioned to ôP = pow P, . We chose to use u' = b 6P/pc. 


k 
n n 
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We then use pee weighting between points k and Ka to arrive at 
2 
ee 
b ôP ley ka 
me (4.4) 


ut is added to a, in calculating Vv.U. 


2 
In our present calculations b = 1.44. We further limit 
[un] to be less than 20% of the maximum of (C, r C} A o 


practice u' is a very small term but an absolutely necessary 
one. For example, in test problem 1 density striations of 
around 50% will occur without using u'. We note also that ôP 


is proportional to Sx P y and thus u' is quadratic in 


nature. The similarity between q and u' is striking. The q is 
an artificial pressure which smooths the velocity field while 
u' is an artificial velocity which smooths the pressure field. 


4.6 Neighbour selection 


The method requires a good selection of representative 
neighbours at each point in time. We have found out that the 
neighbours whose bisectors form the Voronoi polygon around the 
point k are an excellent choice. The Ken Voronoi (Voronoi, 


1908) polygon is defined as that region of space which is 
nearer point k than any other point. 


5. SUMMARY 


The partial differential equations (2.1), (2.2) and (2.3) 
along with the equation of state (2.4), which describe the time 
evolution of compressible fluid flow can be solved without use . 
of a Lagrangian mesh. The method follows embedded fluid 


points and uses finite difference approximations to We and 


7 7 ù to update p, ù and e. We have demonstrated that the 
method can accurately calculate highly distorted flows without 
difficulty. The finite difference approximations are not 
unique and improvements may be found in the near future. The 
neighbour selection is not unique, but the one being used at 
present appears to do an excellent job. The method could be 
directly extended to three dimensions. One drawback to the 
method is its failure to explicitly conserve mass, momentum and 
energy. In fact, at any given time the mass is not defined. 
We must perform an auxiliary calculation by integrating the 
density field over space to obtain mass, energy and momentum. 
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However, in all cases where we have done this, we have found 
the drift in these quantities to be no more than a few present. 
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APPLICATION OF DISCRETE VORTEX METHODS TO THE 
COMPUTATION OF SEPARATED FLOWS 


J.M.R. Graham 
(Department of Aeronautics, Imperial College, London) 


1. INTRODUCTION 


In its simplest form the discrete vortex method uses 
sequences of discrete vortices to represent a cross-section 
of a two-dimensional vortex sheet which in turn may be viewed 
as the infinite Reynolds number limit of a free shear layer. The 
velocity field of these inviscid "point" vortices is used to 
convect the vortices and hence the evolution of the sheet is 
calculated by a step by step forward time integration. A 
great deal has been published on this subject and the present 
paper will concentrate on only one aspect, the modelling of 
separated low speed flows around two-dimensional bodies. In 
this case the shed vortex sheets are usually assumed to be 
two-dimensional also, although in many cases that is only 
true in the mean. A comprehensive review of the simulation 
of three-dimensional vortex flows has been given recently 
by Leonard (1985). These simulations have been mainly 
concerned with isolated free vorticity (eg. the evolution of 
vortex rings or transitional boundary-layer disturbances where 
major wall separation is not involved). Three-dimensional 
vortex sheet separation from bluff bodies has been studied much 
less than two-dimensional because of the greatly increased 
computing requirement. Rehbach's (1978) work and Billet's 
(1980) calculation of the flow over a swept and tapered step 
are examples. 


Application of the discrete vortex method to the simulation 
of flow around two-dimensional bluff bodies was largely 
initiated by Clements' (1973) modelling of separated flow past 
a semi-infinite bluff body and Gerrard's (1967) more approximate 
model for the wake of a circular cylinder. In addition to open 
separations where two shear layers on either side of a wake 
interact, the discrete vortex method has been used to study 
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closed separations in which a separated shear layer reattaches 
to the body surface (eg. Kiya et al. (1982), Ashurst (1979a)). 
Simulation of this type of flow has much in common with the 
former but will be considered in less detail in this paper. 


In the case of two-dimensional (planar) low speed flows 
the vorticity convection equation: 


2 
ew dw d w 
w tU a FV (1.1) 
t j ð ax, ə. 
a j z z ES 


shows that the vorticity w(=VAu) is convected with the fluid 
particles. In an inviscid flow (v=0) the vorticity of each 
fluid particle therefore remains unchanged with time, and for 
an irrotational free stream can only be generated by pressure 
gradients acting at the boundaries of the fluid. Whether the 
flow is treated as inviscid or viscous, vortex methods replace 
the convective left-hand side of equation (1.1) by the 
Lagrangian equivalent: 


= U(z) (1.2) 


where z is the position vector, (xtiy) in complex notation, of 
a given particle having vorticity w; U is the corresponding 
(complex) velocity. In a two-dimensional flow the vorticity is 
related to the stream-function by the Poisson equation 


vyp=-w. (1.3) 


2. REPRESENTATION OF VORTEX SHEETS 


In the absence of three-dimensional disturbances and 
viscous diffusion all the vorticity in the fluid lies on 
infinitesimally thin sheets. Some parts of these sheets 
may roll up forming core regions containing spirals of infinite 
length. But, these regions apart, the Biot-Savart integral 
solution of equation (1.3) can be evaluated over sheets 2 of 


finite length so that the velocity field may be written: 


Aya has Chie) 
u(z') 27 S z'* = Ze (2.1) 
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where T(z) is the circulation of a sheet from its end up to 
the point z, and * denotes complex conjugate. 


The evolution of the vortex sheets is obtained by numerical 
integration of equations (1.2) and (2.1). The simplest formula 
uses first order Euler integration of 1.2: 


z', =z, + U(z,)óôt Bis 
4) a (25) yeye 
Ar 
BAR k (2.3) 
with U(r.) = 5> kel Z% - 2s 
20 5 k 
J kj j 


= is the new value of ey after a time step 6t. The 25 j=l 


to N, are a set of discrete points covering the vortex sheets, 
each at the centroid of a small section of sheet having 


circulation Ar The AT are conventionally thought of and 


depicted as discrete point vortices attached to fluid 
particles, hence the name of the method. 


Equation (2.2) is first order accurate in ôt. Greater 
accuracy may be obtained by using, for example, a Runge-Kutta 
multistep integration procedure. But the accuracy with which 
(2.3) represents (2.1) depends on the distribution of the z, 
along the sheets. J 


Rosenhead (1931) and later Westwater (1935), for the roll- 
up of a trailing vortex sheet, used a small number N of the 
Zj points on the sheet. They found that in each case the 


sheet rolled up in a fairly orderly way. Later work by Moore 
(1974) using a closer spacing of the zj led to rapidly 


developing chaos in the structure of the sheet. Fink and Soh 
(1978) suggested that this could be prevented by ređiscretising 
the sheet at each time step to retain a uniform distribution of 
the convected 25 and therefore a more accurate representation 


of the principal value integral. However Moore (1979) has 
shown that although rediscretization slows the development of 
instability it does not remove it since all such discrete 
representations of a vortex sheet of zero thickness are 
pathologically susceptible to the Helmholz instability. 

van de Vooren (1980) has given an analysis of the singular 
integral in (2.1) as a discrete sum including the effect of 
curvature of the vortex sheet. 
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The Helmholz instability may also be reduced by damping 
the rate of growth of the smallest wavelengths. Chorin (1973) 
considered each discrete vortex in the discretization of the 
sheet as having an independently diffusing viscous core. 
This scheme, which Chorin has shown to give a representation 
of viscous diffusion, does also stabilise discrete vortex 
models of vortex sheets. Other workers have used constant finite 
core vortices for the latter purpose only. Leonard (1980) 
gives a detailed discussion of the representation of diffused 
regions of vorticity. 


It is also possible to carry out the integration of (2.1) 
along the sheet by considering piecewise constant (or linear) 
distributions of vorticity over short (usually straight) 
elements of sheet, eg. Faltinsen and Pettersen (1983), Basu 
and Hancock (1978), Higdon and Pozrikidis (1985) (arc elements) 
and Pullin (1978). The last of these used a circulation 
Parameter as the variable of integration along the sheet 
following Smith's (1968) technique for the integration of the 
steady sheet from the leading edge of a slender wing. It 
appears from these results that the higher the order of the 
representation and hence the smoother the resulting integral 
function along the sheet the smaller are the instabilities 
which develop in a given time interval. 


In the case of the representation of three-dimensional 
vortex sheets by line elements of infinitesimal core size 
a further problem arises due to the singularity in the 
self-induced velocity. The problem may be avoided by 
considering the sheet locally as a strip of curved surface 
rather than as a line, see eg. de Bernardinis et al. (1981). 
This will not be considered further here. 


3. IMPULSIVELY STARTED FLOW PAST A SHARP EDGE 


The local details of vortex shedding from a sharp edged 
body, where separation is observed to be fixed at the edge, 
can be investigated by considering the separating shear 
layer from a semi-infinite wedge (figure 1) of the same 
internal angle 6. It is convenient to transform the region 
exterior to the wedge into the right half (ç) plane by the 


conformal transformation: — = oli z 1/k where c is a 
length scale and k=2 - 6/m. The complex potential in the ¢ 
plane of a general unseparated flow at the edge has the form: 


w(t) = 4 vies + UT H.a. 


where only the first two terms corresponding to flow round and 
symmetrically off the edge are relevant here. As a result of 
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the onset at t = O of V velocity round the edge, neglecting 
viscous diffusion, separation occurs and a vortex sheet is 
shed into the fluid. 


The equation for the subsequent development of the sheet, 
z2=2(T,t) is 


azs _ aw] (3.1) 


In addition a condition of smooth separation from the 
edge must be applied. The correct application of this 
Kutta-Joukowski condition has been discussed by Giesing (1969). 
He showed that provided the sheet is modelled by a continuous 
sheet of vorticity in the vicinity of the edge, it must leave 
the edge tangentially to the windward side, but with curvature 
which may be infinite at the point of separation. The surface 
velocity Q is therefore finite, continuous and non-zero on 
the windward side and goes to zero as z*O on the lee side, 
except when the wedge angle 6=0. In order to achieve this at 


the singular point z = O, 3| = O in the ¢-plane. The total 


complex potential for the separated flow is W=W_+ Woe where 


Wp is the part due to the vortex sheet. Replacing the 


interaction of the vortex sheet with the body by an image 
sheet in the transformed plane which makes the imaginary axis 


a streamline of the flow: 


aa Font hae eye feo Jehan’ hace E 
az ðz 27i 3z c-2(T',t) a c+e*(T', t) 


o 
(3.2) 
The convection equation for the vortex sheet becomes 
EE naire hR r A ~1/k-1 2U(t) _2-2/k z2/k-1l Ki] 
ae aS v(t) z AE S 
is T, -T 
z 1/k-1 f o ar' -f o ar } ahs 
Qmik Uo glk zr) L/* o 2/5 zar) ik 


and the Kutta-Joukowski condition gives 
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(3.4) 
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NU 
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Fig. 1. Vortex sheet shed from a sharp edge. 
4. CONCENTRATED POINT VORTEX REKRESENTATION (FIG. 1) 


In the most simplified model of a spiral vortex sheet, the 
sheet is replaced by a concentrated point vortex r at Zo at 


the approximate 'centre' of the spiral joined to the shedding 
edge by a cut in the plane across which the velocity potential 
is discontinuous. The cut represents the effect of the sheet 
feeding vorticity from the edge to the growing vortex core. 
The equations governing the growth and motion of this vortex 
have been derived by Rott (1956) and Graham (1977) for 
unsteady separated flow, based on the original theory of 
Brown and Michael (1954) for steady separated flow over a 
slender wing. 


The zero total force (complex) equation 


225 25 Fo aw (4.1) 
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represents in integrated form the condition of continuity of 
pressure across the sheet. The Kutta-Joukowski condition at 
the edge is: 


ow 


ac 


n 
9 


(4.2). 
t=O 


These equations are dimensionally exactly the same as the 
corresponding equations for the continuous sheet which they 
replace. Impulsively started flow of the form U=0, 


v=ve" gives rise to a self-similar development of the vortex 


sheet, first investigated by Blendermann (1969). Equation 


ð! 
4.1 requires evaluation of the complex velocity = at z, 


ir S 
less the singular part EIA) . 
Thus Ik -1 
ir z 
W pic Yeye Uee ai eso 
3z k © k Zk |22 I1/k__*i/k 
22, Re say Be. 


(4.3) 


The second term is Routh's correction, arising from the 
distorting effects of the transformation on the vortex core. 
This correction is required in all discrete vortex 
calculations for which the velocity field is calculated in a 
transformed plane. 


Equations (4.1) and (4.2) have the solution 


Wes 20 ‘i 1/(2k-1) 1? et A (4.4) 
Soak 
_ — k/(2k-1) ið 
z Le z (4.5) 
where 6 = -k cos (5 vk ) (4.6) 
and _ V4-k (k-1) (2k-1) y (4.7) 
RIESS 


2k“ (3y k +k +1) 
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The length scale (the only one for this infinite wedge flow) 
is 


we (k-1) /(2k-1) k/(2k-1). 


L (vt) 


The constant of proportionality for the circulation Da is 


overestimated in comparison with more detailed representations, 
see Table 1. 


TABLE 1 


Impulsively started, constant velocity (y=0) 


Angle Blendermann|Fink and Soh;/Pullin|Multi-vortex|Brown and 
Michael 


On| 26s x2, 321! 2.418 2.398 2.38 2.493 
90° {us| 1.983 1.794 1.684] 1.70 1.923 


When the V-velocity dominates the flow as above the vortex is 
shed at an angle which is greater than 6 (the tangent to the 


windward edge) for all edge angles Tess than 138.5°. But in 
cases for which a large U-velocity is present it is shed 
along the external bisector. 


A defect in this model is the necessity to enforce a full 
stagnation (Kutta) condition in the transformed plane. This 
leads to an unrepresentative full stagnation point at 
separation in the physical plane for non-zero edge angles and 
departure of the separating streamline along the bisector 
rather than tangentially to the windward edge. However the 
great merit of the model is its simplicity which allows one 
to examine the behaviour of some vortex shedding problems 
approximately without extensive computation. 


If the vorticity from a given edge changes sign with time 
it may become necessary to identify new centres of vortex 
roll-up. Maskell's (1972) condition for a separating vortex 
sheet specifies that it will change direction at the same time 
at which its strength changes sign, from being parallel to 
one side of the_edge to being parallel to the other. This, 


suggests that soo is a suitable criterion for the shedding 
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ər. 
of a new point vortex core. Other criteria, such as an 


passing through a minimum may also be used. 


5. MULTI-VORTEX REPRESENTATION OF THE VORTEX SHEET SHED 
FROM AN EDGE 


Pullin (1978) analysed the development of the sheet 
described by equations (3.3) and (3.4) for impulsive flows 
round a sharp edge. He expressed (3.4) in self-similar form 
and then evaluated the integral numerically over the main 
part of the sheet, by a trapezoidal scheme. To avoid the 
need for an infinite set of elements the inner spiral part 
of the sheet was represented by a single concentrated core 
vortex, treated in exactly the same way as the isolated 
Brown and Michael vortex with a cut joining it to the end 
of the continuous part of the sheet. 


Most calculations of separated flow past bluff bodies 
have tended to use the simplest multi-point vortex rather 
than continuous element representation of the sheet. This 
is more convenient and faster to compute but gives a less 
accurate and less smooth velocity field. It is also, like 
the continuous representation, limited to the outer part of 
a spiral sheet if an orderly roll-up is to be obtained. In the 
inner part of a spiral it is essential for stability to 
maintain a smaller separation between succeeding vortices 
along the sheet than the gap between turns of the spiral 
(Moore, 1974). Therefore a concentrated inner core vortex 
should again be used. This is often not done because of the 
difficulty in general calculations of identifying new centres 
of roll-up as they form. The result is structureless clouds 
of point vortices which are or may be justified by the turbulent 
diffusion which would be present in reality at high Reynolds 
numbers. 


The vortex sheet y leaves the edge tangentially to the 
s 


wetted surface as shown in Fig. l, and except in the case of 
zero internal edge angle Yia Na and Y70 at the edge. Y2 


vorticity cannot be shed in inviscid flow at the edge at all 
but secondary separation on surface 2 may occur ahead of the 
edge releasing vorticity of opposite sign into the primary 
sheet. Giesing (1969) shows that a shed vortex sheet of 

this strength satisfies the usual stagnation point Kutta 
condition in the transformed plane in which the edge has been 
opened out into a plane surface. This removes the infinite 
part of the velocity in the physical plane. 
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If however the vortex sheet is replaced by a finite number 
of points and the Kutta condition is satisfied at the edge, 
a full stagnation point must exist in the physical plane with 
both Q and Q> O at the edge, except for the case ô=0. Many 


calculation methods therefore fix the strength and position of 
the last vortex shed by satisfying the Kutta condition and 
then obtaining an estimate of Q) either from the velocity near 


to but not at the edge (at which it is either O or ~) or from 
the convection velocity of the last vortex or vortices to 
have been shed, eg. Clements (1973). Phrases such as boundary 
layer thickness are often used in this context to define a 
point at which to evaluate Que but it is clear that no such 


length scale can be consistently defined for what is 
basically an inviscid model. The rapid variation of Que near 


the separation point (Figure 2) makes its direct estimation 
point vortex representation likely to be inaccurate. Use 

of the convection velocity (Sarpkaya (1975), Evans and Bloor 
(1976), and in effect Kiya et al (1982)) which varies less 
rapidly just off the edge is likely to be more accurate, 

but must be based on a correct assumption for the velocity of 
the flow_on the separated side of the sheet. 
6=45° 


. & 
Multi- o Edge sheet el 
vortex 9 


+ Single pt. vortex 


vortex sheet and surtace velocity distribution for 


Fig. 2. 
uniformly accelerating flow past a 45° edge. 
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Alternatively the release point may be specified, usually 
just above the desired separation point, and the vortex 
strength calculated by satisfying the Kutta condition at the 
separation point. Kamamoto and Bearman (1980) systematically 
studied the effect of such an artificially specified release 
point for the case of a normal flat plate in steady flow. 
They concluded that satisfactory results could be obtained 
for the shape and strength of the shed vortex sheet provided 
the non-dimensional ratio of the release height, free stream 
velocity and time step lay within a certain range. However 
this technique has been criticised for introducing an 
empirical parameter and for eliminating some of the feed back 
in the vortex shedding process. It is also possible, Graham 
(1977), to use the Brown and Michael equations to calculate 
the position and strength of each new vortex. Since the V 
velocity is suppressed by the previously shed vortices of 
the sheet, U-velocity dominates and the new vortex leaves 
along the bisector of the edge (8<90°) or at a more acute 
angle for larger edge angles. The predicted initial 
trajectories of the vortices leaving the edge are therefore 
incorrect and the local details of the flow field are wrong. 
However comparison with more accurate representations 
indicates that the strength of the sheet is given quite 
accurately and the shape of the sheet and velocity 
distribution on the body surface a small distance away from 
the edge are also accurately predicted (Figures 2 and 3). 


-Y (scaled) 


a $ Cloud-in-cell 


1 oN od (tS | 
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Local vortex sheet shape from various computations. 


Fig. 3. 
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Some of the difficulties associated with shedding 
point vortices to represent a continuous sheet may be avoided 
by using a continuous plane sheet element for the vorticity 
shed during the latest timestep. In reality this part of 
the sheet is curved with infinite curvature at the edge but 
a plane element is nevertheless sufficient to give the main 
components of the local flow field. Giesing (1969) has 
derived the equations relating the strength Ng of the sheet 


at the edge to the V velocity component round the edge. 
Assuming constant eg Over the short plane element of length s, 


the Kutta condition in the transformed plane gives: 


Das pa 1-1/k 
v= Tike) 608 (| yi (es) (5.1) 


In this case V is due both to the external imposed 
velocity field and to the previously shed point vortices 
representing the rest of the sheet. At the same time, for 
a non-Zero edge angle the vortex sheet grows to the length 
6s(t) = + y,(t - ta since ir is its convection velocity. 
If the circulation y,5s is now considered to be centred on 


a point on the extension of the wetted surface at a distance 
from the edge: 


x, = (1-1/k) és, (5.2) 


closer to the edge than the midpoint of the element, the 
Kutta condition is satisfied as if by a point vortex. The 
result is a prescription for shedding point vortices which 
satisfies the correct condition for a continuous separating 
sheet with: 


2k/(2k-1) 
A a m(k-1) 2il 
ie k 78s ae le sin | pao (5.3) 
k/ (2k-1) 
ER -i6/2 _ „-1/(2k-1) T(k-1) ~id/2 
ale 3 ose noe 2k sin = ae 
(5.4) 
k/(2k-1) 


L is the length scale for the wedge flow (=(V ôt) a 
The local velocity field is calculated assuming that the 


element adjacent to the edge is a sheet element but that all 
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succeeding elements are points. This gives satisfactory 
results for the surface velocity as the calculation of a 
uniformly accelerating flow round a 45° edge as shown in 
Figure 2. 


Clapworthy and Mangler (1974) have shown that the shape 
of a vortex sheet leaving a sharp edge conforms to a curve 


"sax" 2G + 
y 5 ahs 


where the local x' and y' axes are parallel and normal to the 
wetted surface. Figure 3 shows a plot of the vortex sheet 
shape adjacent to the edge computed by a number of different 
point vortex schemes including the 'cloud-in-cell' method 
discussed later. They all follow this shape closely after 
the first elements of the sheet. 


An altogether more accurate but more expensive method is 
to represent the whole of the sheet by piecewise continuous 
sheet elements. Basu and Hancock (1978) for an unsteady 
aerofoil flow and Faltinsen and Pettersen (1983) for a number 
of bluff sections used straight elements. The evolution of 
the sheet is computed in these cases from the velocities at 
the element midpoints reinterpolated to the node points. 
Faltinsen and Pettersen found that because of the large 
distortions induced in a rolling up sheet behind a bluff 
body it was necessary to continuously rediscretise the sheet 
to keep the element lengths in sensible proportion. With 
large rates of strain on the sheet even rediscretisation 
eventually becomes ineffective and the sheet must be severed. 
This operation leads to some problems in selection of general 
criteria for interfering with the sheet in this way and also 
for the selection of new core vortices representing new 
centres of roll-up. However the results are generally superior 
to point vortex representation. Higdon and Pozrikidis (1985) 
using circular arc elements to represent the sheet similarly 
obtained very good results. 


If the body surface is modelled by the surface vortex 
method rather than by transformation and images it becomes 
even more important to use (higher order) panels for sheet 
and wall elements in close proximity in order to avoid 
leakage. Ashurst (1979b) avoided this problem by using local 
images with far field correction by the surface singularity 


method. 
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6. SEPARATION FROM A CONTINUOUS SURFACE 


Unfortunately many practical bodies, not least the 
circular cylinder, have continuous curved surfaces without 
specific edges fixing separation. The prediction of 
separation for such cases cannot strictly be obtained from 
a straightforward boundary layer calculation but requires 
a local Navier-Stokes solution such as Smith's (1977) 
application of the triple-deck technique. Many discrete 
vortex methods avoid this by either specifying separation 
empirically, or using an approximate boundary-layer separation 
condition or replacing the viscous diffusion in the shear 
layers by a viscous vortex model. Stansby (1977) in some 
of his earlier work with a circular cylinder, fixed separation 
at 95° for supercritical flow. Katz (1981) and Basuki (1983) 
similarly fixed the forward separation point on stalled 
aerofoils. In the case of oscillatory incident flows past 
circular cylinders in which the separation points are known 
to move through large angles, and also in the case of 
secondary separation, Stansby (1981) has suggested specifying 
the separation points empirically at points where the velocity 
has dropped by 5% from its peak value. With a specified 
separation point, vortices are usually released from a point 
at a fixed height h above the separation point. This strategy 
is required so that the Kutta condition can be satisfied by 
the point vortex model at the separation point. The point 
vortex model does not however give a very good representation 
of the pressure field just ahead of separation and this is 
a drawback in its use with a separation prediction model. The 
true shape of the separating sheet should induce ahead of 
itself an adverse pressure gradient with a square-root 
singularity at the separation point. As with the sharp 
edge, this behaviour is generally better modelled by a sheet 
of piecewise continuous elements in the neighbourhood of 
separation. 


Deffenbaugh and Marshall (1976) and Sarpkaya and Shoaff 
(1979) and others have included a separation prediction based 
on laminar boundary layer theory in their discrete vortex 
models. This technique has been successful because the point 
vortex model of the sheet smeared out the singularity in the 
adverse pressure gradient sufficiently for convergence to 
a stable and realistic separation point. If however a more 
accurate representation of the sheet is used the square root 
singularity will tend to drive the separation point predicted 
by laminar boundary layer theory ahead of it until the smooth 
separation point far upstream of the true one is reached. 
Smith's application of triple deck theory allows a proper 
viscous-inviscid interaction to be carried out giving a 
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better prediction of separation for steady flows. Fiddes 
(1980) has also used it successfully to predict the laminar 
separation line on a slender cone at incidence. Faltinsen 
and Pettersen (see Aarsnés(1984))used it in a similar way, 
assuming it can be applied quasi-steadily at the separation 
points on a circular cylinder. The technique in the latter 
case involved estimating the strength of the pressure-gradient 
singularity from the numerical prediction of surface pressure. 
This was matched to a skin-friction parameter given by an 
upstream laminar boundary layer calculation. The actual 
separation point was then solved for iteratively. An average 
separation angle of about 75° was obtained for the first few 
periods of development of a Karman vortex street. Extension 
of the technique to turbulent boundary layers is not yet 
established. 


Stansby and Dixon (1983) following Chorin's (1978) 
work have as an alternative represented the laminar boundary 
layer on the cylinder by a distribution of discrete vortices 
with an added random displacement at each time-step to 
simulate viscous diffusion. Separation should then, in theory, 
occur naturally when vorticity of opposite sign to the majority 
of the wall vorticity is created at the wall faster than it 
can be absorbed by diffusion. They used this technique with 
point vortices to predict separation on circular cylinders 
in both steady and oscillatory streams. This procedure is 
correct in the limit of very large vortex number density, but 
its accuracy when relatively few vortices are used is not yet 
established. Millinazzo and Saffman (1977) have shown that 
the random walk method only represents viscous diffusion 
accurately when the vortex number density is of the order 
of the Reynolds number in magnitude. In practical calculations 
possible at present the number of vortices across the depth of 
the boundary layer is very few and therefore the velocity 
profile through the boundary layer is crude. However Stansby 
and Dixon obtain good predictions for some of the global 
parameters such as drag and Strouhal number. Spalart et al. 
(1983) have in a different way coupled a boundary layer 
calculation to an outer discrete vortex simulation. The 
boundary layer vorticity is replaced by discrete vortices 
ahead of the separation point (where boundary layer theory is 
still valid). These vortices, initially lying on the 
displacement surface of the boundary layer are shed into the 
outer flow region at (separation) points where a rapid 
increase in the normal flux of vorticity occurs. 


7. SECONDARY SEPARATION 


Secondary separation beneath the primary separated sheet 
is known to have a considerable influence on the behaviour of 
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vortex sheets (for example on a slender wing, Smith (1968)). 
It is observed to be present during starting flow past a 
circular cylinder. Both Stansby and Dixon (1982) and 
Faltinsen and Pettersen (Aarsnes) (1984) have incorporated it 
into their models for flow past circular cylinders and have 
observed that the release of opposite signed vorticity which 
weakens the primary sheet leads to a longer vortex formation 
region. Excessively short formation regions are a frequent, 
unrealistic, outcome of discrete vortex calculations also 
associated with overpredicted force coefficients. Significant 
opposite signed vorticity is only shed during a small part of 
the main shedding cycle when the latter is close to the 
surface and induces high reversed flow. It may be therefore 
that the roll up of the main shear layer is very sensitive 
to a weakening effect at that time. The prediction of the 
secondary separation points is difficult because of the 

large changes in the instantaneous velocity field in the 
separated region. For that reason the empirical 'velocity 
drop' rule is often resorted to. 


8. THE CLOUD-IN-CELL METHOD 


The fact that the number of operations required by the 
Biot-Savart integration increases as n? where No is the 


total number of vortex elements in the flow at a given time 
puts a strong constraint on the number of time-steps for which 
a developing flow can be computed. For this reason alternative 
less expensive techniques of computing the velocity field 
have been sought. If the vorticity distribution is assumed 
to have a finite support, each vortex element occupying a blob 
of space (Chorin (1973), Leonard (1980)) and therefore 
representing a diffused sheet it is possible to solve the 
Poisson equation (1.3) linking the stream-function to the 
vorticity field by differential methods. This can be done 
either by direct reduction methods or by fast Fourier 
transforms. However the resulting equations still involve 


2 
O(N.) operations per time-step, due to the irregular 


distribution of the vortex elements in space. Van der Vegt 
and Huijsmanns (1984) used a cubic spline interpolation for 
the Fourier functions which considerably increased the 
efficiency. This is a grid free process but it is not clear 
that the interpolation errors are of lower order than those 
arising from the use of an Eulerian grid. 


More usually, methods which calculate the velocity field 
directly from the Poisson equation do so by finite difference 
solution on a regular Eulerian grid. This is the basis of the 
hybrid Langrangian-Eulerian method known as the Cloud-in-cell or 


j 
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Vortex-in-cell method, (Christiansen (1973)). 


The usual procedure for a two-dimensional flow is as 
follows. The vortex sheets shed into the fluid are 
represented at time t by arrays of point vortices 


AT k=1...N, at position 2, (t) + The individual circulations 
ary of these vortices are distributed as vorticity Way over a 


regular mesh, usually by area weighting (bilinear distribution) 
to the four surrounding node points. The finite difference 
form of equation (1.3): 


2 2 


A 5 avis 
Stn +=- w (8.1) 
Ax Ay J 


may now be solved using a fast elliptic solver. A number of 
methods are available ranging from calculating the double 
finite Fourier transform of (8.1) in both x and y directions 
to the direct block elimination methods. 


The velocity field on the mesh wy! Wai is obtained by 


central differencing the stream-function on the mesh. The 
same area weighting as for the vorticity is then used to derive 
the velocity field at the vortex points themselves which is 
used to convect the vortices. 


This distribution method on a uniform rectangular mesh 
satisfies conservation of vorticity and of its two first 
moments Tx aT and Zy Aly + The velocity field induced by a 


point vortex on the mesh falls to zero at the position of the 
vortex so that no self-induced velocity is generated. Point 
vortices thus appear within the mesh as if spread over a 

finite core comparable to the local mesh size, but not 
perfectly isotropic and depending on the position in the cell. 
The numerical viscosity introduced by the mesh is non-diffusive, 
but has the effect of preventing instabilities of sub-mesh 
length-scale developing, without removing all previously 
created detail at this scale, see for example Baker (1979). 


As with the discrete vortex method either transformation 
or singularity methods may be used to represent a solid body 
in the mesh. In the former case we transform a single closed 
body into a circle which is surrounded by a concentric polar 


mesh. 
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1...N 


" 
i] 


The regular mesh 8, 2ni/N i 


1...M 


di 
"i 


r, =R(H8)I + j 


J 
where R is the radius of the circle has the advantages of 

a constant mesh aspect ratio and of concentrating mesh points 
next to the body. The polar form of equation (1.3), Fourier 
transformed with respect to 8, is solved in finite difference 
form: 


D 
=€ 
1 
N 
E 
+ 
=€ 
N 


K (8.2) 
Ein E ai 0 

ERT 

J 


where Bey and ae are Fourier transforms of vij and Wi5¢ 


Gaussian elimination is used for the resulting tridiagonal 
set of equations. 


As with all transformation based methods it is more 
efficient to convect the vortices in the circle (t) plane 
using a modified convection equation: 


2 2 2 

* 

ao" eal ET S ir a bk dtk dtk a 
ot ac 4n az az, az, 


The transformation modulus and Routh correction are calculated 
at the mesh nodes and prestored. Transformation usually 
produces a highly distorted mesh in the physical plane with 
small meshes in the separation regions which is likely to 

be advantageous. 


Alternatively the body may be overlayed by a regular, 
usually rectangular, mesh. The zero normal velocity boundary 
condition is then satisfied at N points on the body surface 
thus specifying Ny unknown singularities at the Np nođes of the 
grid which are interior to the body. The capacity matrix of 
influence functions relating these points to the boundary 
collocation points can be calculated at the start, inverted 
and stored. A variation of the method which offers improved 
accuracy for thin bodies, replaces the surface of the body by 
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a sheet of dipole panels, In two dimensions the method of 
Kennedy and Marsden (1976) can be used to obtain the velocity 
on the panels in order to satisfy a condition of constant 
streamfunction on the surface of the body. 


Use of Fourier transforms to solve the Poisson equation 
is consistent with a periodicity in the direction on the polar 
grid. For other grids there is some error or extra work to 
avoid the implied periodicity. There is generally a problem 
with the velocity field associated with any vortex lying 
within one mesh of a solid boundary. For that case the 
vortex and its image in the boundary have implied cores which 
overlap leading to considerable reduction in the calculated 
velocity. It is therefore necessary to include a short 
range correction to the velocity field particularly for 
vortices close to the separation point. 


9. FLOW PAST A NORMAL FLAT PLATE CALCULATED BY THE 
CLOUD-IN-CELL METHOD 


This flow has the advantages of fixed separation points 
and that transformation to a circle can be carried out 
analytically. If an impulsively started flow past a normal 
plate is allowed to develop naturally asymmetry grows very 
slowly from the level of the rounding errors. In order to 
obtain a Karman vortex street in a practical length of 
computation time some artificial asymmetric stimulus is 
required at the start. Examples are the addition of one 
strong vortex downstream of the plate (Naylor (1982)), 
weakening of one of the two separating vortex sheets, 
(Sarpkaya and Shoaff (1979)) or application of a temporarily 
asymmetric incident velocity (Graham and Naylor (1982), 
Faltinsen and Pettersen (1983)). In the calculation shown 
here, the shedding process releases a number n of unit 
strength vortices at each time step, where n is adjusted to 
the nearest integer satisfying the Kutta condition. This 
leads to some jitter in the sheet but gives a denser 
representation of the sheet. 


If the vortex shedding due to a steady stream is given 
an initial asymmetric perturbation, a Karman vortex street 
rapidly develops (Figure 4). The Strouhal number for this 
calculation is S=0.146 which compares reasonably with 
measured values. But the vortex formation region appears to 
be too short and the vortices in the street too concentrated 
and distinct compared with observations of high Reynolds 
number flows. It is usually reckoned that about 40% of the 
vorticity shed from the edges of a bluff body is lost in the 
near wake by mixing with vorticity of opposite sign (Fage and 
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Johansen (1927)). In the above flow simulation very little 
mixing occurs across the wake and there is no scope for other 
forms of cancellation. This in turn leads to drag forces on 
the body considerably higher than the mean value which is 
usually measured, However the simulation does compare well 
with the flow visualisation of a flat plate of low aspect-ratio 
between end-plates taken by Perry et al. (1982). Reduced 
aspect-ratio between end-plates is known to increase the 
spanwise correlation of the vortex shedding, giving approximate 
two-dimensionality when the aspect-ratio is less than about 4. 
Locking of the vortex shedding by oscillation of the body has 
a similar effect and results in a reduced formation region, 
increased drag and more concentrated vortices. 


Fig. 4. Cloud-in-cell computation of the vortex sheet behind 
a flat plate in steady flow. 


If, however, the vortex shedding is triggered 
two-dimensionally by the flow, as is the case for an in-line 
oscillation of the free stream, much better agreement with 
experimental measurement is obtained. Figure 5 shows the 
vortex pairs shed from a flat plate of width D during the 
first cycle of a reversing oscillatory flow ORS sin (21t/T) 


compared with flow visualisations taken in a similar case by 
Oshima and Oshima (1980). For longer periods of flow the 
predicted vortex shedding patterns are also in substantial 
agreement with observation for values of Keulegan-Carpenter 
numbers, Ko = RA T/D, up to about 30, The vortex shedding is 


generally asymmetric for Ko”? 5 and the resulting force 


history periodic. 
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Oshima & Oshima 
(1980) 


Flow around an 
impulsively atarted 
aerofoil, travelling 
normal to its chord 


Fig. 5. Symmetric vortex shedding from a flat plate in 
oscillatory flow, Ko =2. 


The predicted drag and inertia coefficients agree well 
with experimental measurements, Figure 6. Visualisation of 
these oscillatory flows does show that with some exceptions 
the vortex shedding is substantially correlated across long 
lengths of span. 


293 


294 GRAHAM 


| Computed « J 
° measured — Obasaju, 1981 


(0) IS des sts a ee er 
0 4 8 12 16 20 24 28 32 36 44 
Ke 


Fig. 6. Drag and Inertia coefficients for a flat plate in 
oscillatory flow as functions of Keulegan Carpenter 
number, 


10. VORTEX DECAY 


In order to obtain more realistic predictions for the 
forces and vortex strength in the wake, particularly for 
steady incident flows, an artificial vortex reduction or decay 
has frequently been incorporated into discrete vortex 
calculations, see eg. Sarpkaya and Shoaff (1979). Katz 
(1981) also, reduced the strength of vorticity shed from the 
leading edge separation point on a stalled aerofoil by an 
empirical factor at separation. A continuous decay factor, 
based on the diffusion of a viscous vortex, is very often 
applied. The circulation within a fixed radius Xo of the 


centre of such a vortex 'decays' with time as: 
2 
T(t) = rT (l - exp (-x/4vt)) (10.1) 


This apparent rate of decay is then applied to every point 
vortex, based on its residence time in the flow (eg. Kiya et al 
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(1982)). This technique may model some short range 
cancellation occurring through diffusion and mixing but 
generally violates conservation of circulation in the far 
field. Others, eg. Naylor (1982), have used simpler 
exponential decay laws. 


The effect of vortex decay of whatever sort, in the case 
of flow behind a bluff body is to lengthen the formation 
region a little and reduce the mean drag coefficient 
considerably, For example in Naylor's case the mean drag 
coefficient of the normal flat plate was reduced from above 
3.0 to 2.3. On the other hand when used in the simulation of 
a reattaching shear layer, the bubble length is reduced by 
vortex decay, see Kiya et al (1982). This is in line with 
the observed effects of decreasing Reynolds number on flow 
down a step and here the vortex decay is probably mimicking 
the reduction associated with production of negative vorticity 
on the wall beneath the bubble. Without vortex decay there is 
some doubt as to whether the asymptotic bubble length is 
finite, 


ll. THREE DIMENSIONAL EFFECTS 


Since the wake behind many nominally two-dimensional 
bluff bodies is known to be instantaneously three-dimensional, 
it has been suggested that the observed vortex reduction may 
be associated with three-dimensional perturbation of initially 
spanwise vortex lines. This does not reduce the overall 
amount of circulation unless like diffusion it leads to mixing 
of vorticity of opposite sign. But it does change the 
velocity near field induced by a vortex line and it is 
possible that numerical vortex decay simulates this effect. 


A numerical experiment is described in Graham and Naylor 
(1982) to see the effect on a discrete vortex calculation of 
a small amount of sinusoidal, spanwise waviness. The results 
showed a significant effect on the vortex formation region 
and a corresponding reduction in the drag coefficient. 
Three-dimensional effects may also be important in modelling 
the reattachment of a shear layer. Kiya et al (1982), for 
example, found that the mean velocity profile, 'turbulence' 
intensity and shear stress are badly predicted in the 
reattachment region by two-dimensional calculations. Faltinsen 
and Pettersen (Aarsnes (1984)) and Stansby and Dixon (1982) 
however have succeeded in obtaining quite long vortex 
formation regions in the base of a circular cylinder when 
secondary separation alone was incorporated. 
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12. APPLICATION TO SEPARATED FLOWS ON AEROFOILS 


The Cloud-in-Cell method may be used to calculate a wide 
range of bluff body flows by making use of numerical, if 
necessary, transformation techniques. Another important area 
of application is to unsteady and separated flow over aerofoils. 
Katz (1981) has applied the discrete vortex method to 
impulsively started separated flow past an aerofoil. Spalart 
et al (1983) have applied the same method to this problem 
together with a boundary layer representation and Lewis and 
Porthouse (1983) used the discrete vortex method with a 
random walk method of predicting separation similar to 
Stansby and Dixon's (1983). Kats used a vortex reduction 
factor for the sheet separating at the leading edge and Basuki 
(1983) using the Cloud-in-cell method, found similarly that 
the upper surface pressure distribution was very sensitive 
to the strength of the separated sheet above it. He found 
that without any form of vortex decay the strength of this 
shed sheet caused the sheet coming off the trailing edge to 
roll up very strongly above the trailing edge and induce 
strong suction pressures in that region. 


Instantaneous smoke trace photograph Mean smoke trace photograph 
Re Re. No.= 10° 
Uœ 


Fig. 7. Vortex wake from separated flow over 12% thick 
Joukowski aerofoil at 30° incidence. 


However the use of a similar rate of decay to that used by 
Kiya et al (1982) gave a more realistic wake development 
(Figure 7) and mean pressure distribution (Figure 8). The 
need to introduce this artificial aid may be associated with 
the difficulty of modelling a separated sheet in close 
proximity to a solid surface. 
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Fig. 8. Mean separated flow pressure distribution on the 
surface of a 12% thick Joukowski aerofoil at 
30° incidence. 


13. CONCLUSIONS 


This paper has concentrated on the use of the discrete 
vortex method to model vortex sheets shed from two-dimensional 
bluff bodies. The method clearly also has important 
applications in areas of three dimensional wake modelling 
(as behind road vehicles for example) and for compressible 
flows. 


Discrete vortex calculations of separated flow have 
perhaps not yet reached the status of conventional Eulerian 
mesh solutions of the Navier-Stokes or compressible Euler 
equations for routine purposes. Flow simulations by means 
of this general method are however becoming increasingly 
reliable and therefore may offer more hope in the short term 
of an accurate method of simulating high Reynolds number 
flows in which large scale separations and large eddy 
structures dominate. The main problem areas, from a 
practical point of view, seem to be the prediction of 
separation points for the shedding of vortex sheets from 
continuous surfaces and the resolution of the problem of 
apparently excessive sheet strength. The latter points 
to the importance of modelling secondary separation and 
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three-dimensional effects as well as diffusion. 
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l. INTRODUCTION 


The inspiration for the work presented here is a tantalizing 
paper, "A multiple grid scheme for solving the Euler equations", 
given at a conference by Ni (1981). Ni showed results that 
seemed very impressive in accuracy as well as speed, but he 
gave only an indication of how they were obtained. Such speed 
and accuracy might enable three-dimensional solutions of the 
Euler equations to be a matter of routine in practical 
aerodynamic design. However, other researchers have had 
difficulties in attempting to reproduce Ni's speed. For 
example, there were four papers given, on various extensions to 
Ni's scheme, at the corresponding conference in 1983, and in 
only one of these, by Ni's fellow-researcher Davis (1983), was 
the speed of Ni's scheme reproduced. Also presented in 1981, 
at the same conference, was a paper (Jameson et al, 1981) on 
numerical (non-multigrid) solution of the Euler equations using 
multi-stage time-stepping schemes of Runge-Kutta type. 
Subsequently, Jameson (1983) presented a multigrid method of 
his own that incorporated such multi-stage time-stepping. The 
schemes to be described here are developments suggested by the 
work of Ni and Jameson. The mathematical treatment is 
restricted to flow in two space dimensions, and the illustrative 
numerical results are for aerofoils, but the schemes are 
essentially applicable to a wide tange of configurations. 


A basic feature of the present schemes is the form of the 
approximation for the flux balance for a computational cell, 
where the values of the dependent variables are specified at 
cell vertices rather than at cell centres or as cell averages. 
In this we choose to follow Ni (and Denton, 1981) rather than 
other researchers who, like Jameson, adopt cell-average 
approximations. To reflect the roles of cell and vertex we 
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call the schemes 'cell-vertex' schemes. The advantage of cell- 
vertex schemes is that second-order spatial accuracy can, in 
principle, be obtained for the flux balance, even at boundaries 
and with non-smooth grids. 


However, to achieve such accuracy new developments, beyond 
the details given by Ni (1981), have been required. For 
practical shapes the grid will typically be far from uniform, 
with cells near the leading edge having dimensions smaller than 
the leading-edge radius and cell dimensions in the far field of 
the order of the wing chord. It is found that when the scheme 
suggested by Ni is applied to such highly stretched grids heavy 
smoothing is needed to ensure numerical stability, and a 
substantial loss of accuracy results. This defect is remedied 
here by ensuring that mass and momentum changes are conserved 
in a key step of the numerical process. Another development 
has been required for the treatment of the flow-tangency 
boundary condition at the body surface - a topic that is not 
discussed by Ni. Here second-order accuracy is retained by a 
simple modification of the numerical scheme that involves an 
image principle but does not involve any extrapolation. 


In addition to a cell-vertex formulation Ni (1981) adopted 
a time-stepping algorithm of Lax-Wendroff type for advancing to 
the steady state. The process was accelerated by the use of 
multiple grids, but Ni did not provide enough information, 
either in guiding principle or in detail, to enable others to 
reconstruct his multigrid time-stepping algorithm without 
considerable improvisation. For the present schemes some 
principles for multigrid acceleration have been formulated and 
possibilities for time-stepping in a multigrid framework have 
been explored. It is found helpful to take a viewpoint 
suggested by the uniqueness theorem for Cauchy's initial value 
problem for hyperbolic equations. Thus, to reduce the number 
of grid points involved towards the Cauchy minimum we march in 
cycles of steps, with each cycle consisting of successively 
larger steps associated with successively coarser grids; in 
effect, the time-step is greatly extended at small cost. To 
ensure convergence to the fine-grid solution the problem is 
formulated so that coarse-grid changes vanish when corresponding 
changes in the previous step vanish. To advance the solution 
efficiently boundary quantities are updated on the coarser 
grids as well as on the first, finest, grid of a cycle. 
Finally, for further economy in computing time, the calculation 
begins with a reduced number of steps or grid-levels per cycle 
and additional, finer, grids are added as the calculation 
proceeds. It is found that when the solution is advanced by 
Lax-Wendroff time-stepping in a multigrid framework constructed 
on these principles the rate of convergence is indeed greatly 
improved. Yet the number of multigrid cycles required for a 
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given level of convergence remains significantly higher than 
the number required in the multigrid method of Jameson (1983). 


In contrast to the above, Jameson adopts a time-stepping 
algorithm of the multi-stage Runge-Kutta type. This admits the 
use of time steps much larger than those admitted by a Lax- 
Wendroff algorithm and would be expected to give a reduced number 
of multigrid cycles for convergence. Jameson's use of residual 
averaging and enthalpy damping as accelerative devices would 
contribute further, but it seemed worthwile at this stage to 
explore the advantages offered solely by such multi-stage* 
time-stepping. Jameson's multigrid framework seems to be very 
similar in practice to that outlined above, although it may be 
viewed as a member of the class of multigrid schemes advocated 
by Brandt (1977) and conventionally associated with elliptic 
equations and iteration processes for boundary-value problems. 
Thus we wish to combine multi-stage time-stepping with the 
established cell-vertex formulation and multigrid framework 
and then to compare the overall performance of the scheme with 
that of the corresponding Lax-Wendroff scheme. The significant 
factors for comparison are the computing time required for 
convergence (not the number of cycles), and the robustness and 
ease of use of the scheme. 


The task would have been relatively straightforward if we 
simply adopted Jameson's multi-stage algorithm as it stood, 
because the choice of time-stepping algorithm is independent 
of the form of the approximation for the flux balance. However, 
one feature of Jameson's algorithm is the inclusion of a fourth- 
order spatial dissipation as a stabilising device. The flux 
balance for a particular cell thus involves cells beyond the 
immediate neighbours, which complicates the scheme considerably, 
especially when applied to complex configurations with adjoining 
grids. A further objective, therefore, is to find a stabilizing 
device that is free from this complication. The device we 
adopt, after some trial, is essentially a second-order time- 
change rather than a spatial dissipation. It turns out, when 
the performances of various multigrid schemes are compared, 
that the computing times for multi-stage time-stepping are no 
less than those for Lax-Wendrotf time-stepping, even though far 
fewer cycles are required for convergence with the former. It 
is found, however, that a composite Lax-Wendroff and multi- 
stage scheme gives faster convergence, with significantly less 
computing time, than either Lax-Wendroff or multi-stage time- 
stepping on its own. 


e e a aaea A 
* For brevity, the term 'multi-stage' is taken to mean "multi- 
stage of Runge-Kutta type', so that a multi-stage scheme is 

distinct from a Lax-Wendroff scheme here. 
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This paper begins with derivations of the cell-vertex 
schemes for Lax-Wendroff and multi-stage time-stepping without 
multiple grids. The treatment of the flow-tangency boundary 
condition, and of stability and smoothing, are discussed in 
turn. Then the multigrid acceleration schemes, for Lax- 
Wendroff time-stepping and for composite Lax-Wendroff and 
multi-stage time-stepping, are described. Finally, numerical 
results are shown for the RAE 2822 and NACA 0012 aerofoils to 
demonstrate accuracy and speed. Surface distributions of Mach 
number, and convergence histories for lift, drag and a typical 
residual are shown. Comparisons with results from other 
methods are included. 


2. CELL-VERTEX FORMULATIONS 
2.1 Basic Euler equations 


Since we are seeking steady-state solutions of the Euler 
equations it is sufficient to consider the equations 


au _ (Œ , 26 
ry (= k a ' (2.1) 


where U, F and G are three-component vectors given by 


US fop rF =]pu r G= lev P 
2 
pu pu +p pvu 
pv puv ase +p 


and p is given by the Bernoulli equation 


eof (igt uts A). 


These equations describe a model flow in which the total 
enthalpy is constant, Bernoulli's equation having been 
substituted for Euler's energy equation and the equation of 
state. The model flow varies with a time t in a rectangular 
Cartesian space x,y. The quantities u and v are the velocity 
components in the x and y directions respectively, and p and p 
denote the density and pressure. All quantities are normalized, 
with a typical body dimension, the speed of sound at the free- 
stream Stagnation condition and the free~stream total pressure 
respectively taken as the units of length, speed and pressure. 
To derive the required cell-vertex schemes from (2.1) an 
arbitrary curvilinear grid defined only by the coordinates x, y 
of its grid points in Cartesian space is adopted. No f 
transformation of independent variables is made and P, u and v 
are retained as dependent variables, 
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2.2 Lax-Wendroff scheme 


The Taylor-series expansion for the component U in (2.1) can 
be written 


auy u ? 
ôU = () At + E s + higher-order terms , 


where the superscript n denotes the value of a variable at the 


P = +. = 
time level n, ôU = o Tis uw" and At Sem - Ees Substitution 
from (2.1) and change in order of differentiation yields 


n 2 
ape a , 38) ae - (2 (Ew +2 (aye. 
ox dy dx | 3U ot əy \ouU dt/} 2 (2.2) 


We denote the first order change in U and AU, that is, 


iiy Z fare, ac) 
AU = - & + a At. (2.3) 


Fig. 1 Geometry of cell-vertex scheme for single grid 


Now consider the curvilinear grid sketched in Fig. 1. The 
points marked A, B, C and D' are at the centres of the four 
cells shown and the letters also serve to identify the cells. 
The vertices of the cell C are marked 1, 2, 3 and 4. Then, 
given F and G in equation (2.3) at 1, 2, 3 and 4, the average 
first-order change in U for the whole cell C in the time-step 
At is, by the integral theorem of Gauss and the trapezoid rule, 


AU, = (e; = Fyi ya) + (Fy - F,) (y3 7 Y,) 


At 


ESE, E7 G3) (x, = x,) + (G, - G,) (x, -~ x) 20a, 1 (2.4) 


where AAG is the area of the cell C and the values of F and G 
are specified at the time-level n. Note that equation (2.4) 
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is an expression of the flux balance for the cell C. There are 
similar expressions for the first-order changes AU, Au, and 


Au, in the cells A, B and D respectively. 


Now it seems that in Ni's scheme (1981), the first-order 
change at the grid point l is taken to be 


Į 
au, =F (Au, + 4u,, + AU, + 4u,)- (2.5) 


It is found that use of the approximation (2.5) for aerofoils, 
with typical highly-stretched grids, tends to make the 
numerical solution divergent - the change in U from one time 
step to the next increases unless large amounts of numerical 
smoothing are applied. Here we adopt a more general formula: 
instead of (2.5), the first-order change in U in a time-step At 
at the grid point 1 is taken to be 


du, . AA, + AUL . + Au. . AA, +AU 
peices A Pigs € c D BEY ace) 


1 
da, + OA, + AA, + OA, 


Note that (2. . = = = 
ote that (2.6) reduces to (2.5) when da, 4A, 4A, Aap’ 


that is, when the grid is uniform. However the approximation 
(2.6) is regarded not a a formula for averaging but rather as 


1 

an aid to numerical co: >. er -> ii 

nvergence he product Au, z7OA, +AA 
AAG + 4a.) is the change in mass or momentum for an element of 

SR 

area (ha, ay 4a, + 4a, + 4a) , and Au, is the average change in 
U over this element. Thus the approximation (2.6) relates Au, 
to AU,, AU,, etc by conserving th . a 
i a’ B’ y g e changes AU, Z 4a, é Ug . 


7A, 1 etc. 


To estimate the second-order change in U we note first that, 
from equation (2.1) and the definition (2.3), we can replace 


(au/at)™ in equation (2.2) by AU/At. Denote (8F/3U) AU and 
(dG/3U) AU in (2.2) by AF and AG respectively and assume that 
their values at the points A, B, C and D are given by 
approximations of the form 


oF 1 E oF oF 3 

= Ap == =) + () + È + OF A 

ð 4 x P)o . 
E c ep 1 Be 2 Hi 3 ep 4 
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Then, for the element of area OA, with vertices at A, B, C and 


D an expression for the second-order part of the change in U 
is obtained by integration; it is similar to the right-hand 
side of (4) with F and G replaced by AF and AG, respectively. 
It is assumed that AA) = > (aa, + OA, + A, + 4a) approximately, 
so that the corresponding first-order change is given by (216) i. 
Hence the total change for the element of area AA), which we 


take to be the total change at the grid point 1l, is 


ôu, = AU, + (ur, - AF) (Yp T Yp) + (AF, ~ AFQ) (Ye T Ya? 


At 
+ (AG, - AG.) (x, = Xp? + (AG, - AGa? (x, - x,)) 7AA, 5 


The above is a two-stage scheme: 


(i) flow conditions at the vertices of each cell are used to 
calculate the first-order changes within the cell - equation 
(2.4), and 


(ii) the first-order changes in four neighbouring cells are 
used to calculate the total changes at the common, central, 
vertex - equations (2.6) and (2.7). 


Although (2.6) differs significantly from the corresponding 
equation of Ni, we have followed Ni in adopting a scheme with 
the dependent variables specified at cell vertices. One 
advantage over cell-average finite-volume schemes such as that 
of Jameson et al (1981) is that the resulting steady-state 
solution is formally second-order accurate spatially even for 
non-uniform grids that are not smooth. This advantage may be 
obscured by the fact that equations (2.6) and (2.7) are not 
second-order accurate for arbitrary grids. However it is 
observed in numerical calculations that as the steady state is 
approached, that is when ŝu + O, the changes given by equation 
(2.4) behave likewise, that is AU > O. This would be expected 
from the form of the right-hand side of equation (2.7), for 

AF = AG = O if all AU = 0. Equation (2.4) expresses the flux 
balance for a single cell. The only approximation made in 
deriving (2.4) is the trapezoid rule, which has an error of 
third order. The flux balance is, therefore, correct to second 
order. In contrast it is assumed in typical cell-average 
schemes that the conditions on a cell boundary required for the 
balance of fluxes are given by arithmetic means of conditions 
in the pair of adjacent cells. Any such mean is second-order 
accurate only for uniform or suitably smooth grids. In 
practical problems it is difficult to satisfy this condition, 
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especially when treating the more complex configurations, and 
the advantage of the present scheme could be marked. However, 
the overall advantages and disadvantages of the two types of 
scheme have apparently not yet been assessed systematically, 

It is possible that cell-vertex schemes may require more labour 
for implementation. 


2.3 Multi-stage scheme 


In this section we consider 4-stage schemes of Runge-Kutta 
type. Other schemes, with three stages, have also been 
considered, but these were found to be less effective and they 
are not discussed here. If a quantity U satisfies a 
differential equation 

au 
mei R(U) = 0, (2.8) 
n 
and we have U = U at time t = t, then to advance the solution 
by a time-step våt, where v is the Courant number, we may 
follow Jameson et al (1981) and set 


uO) En ] 
Rae Numa oD) va, ber © 
am Oat ent) 
2 (2.9) 
uv) = yp) = va. dtr?) 
uv) = y ~ var) 
n+l (4) 


where ajr Oy and a} are constants. Here we let U be the 
solution U, at the vertex 1, and R(U) be the residual [R(v)], 


in equation (2.1) at that vertex, expressed in some discrete 
form. 


This scheme has been implemented in several ways in attempts 
to calculate the flow past a lifting aerofoil with Courant 
numbers as large as possible. A key question has been what to 
choose for the form of the residual [R(u)] r The obvious 


choice 


AU 1 


De 
where Au, is given by equations (2.4) and (2.6), resulted in 


[Rw], =- 
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instability for all values of the constants oye a2 and a3 tried, 


unless the Courant number was reduced below unity. Presumably 
the process could be stabilized by the introduction of a fourth- 
order dissipation as Jameson et al (1981) have done. However, 
an alternative that is free from the complication of fourth- 
order dissipation has been found in the second-order part of 
the total change ôu, in equation (2.7). Equation (2.7) can be 


written 
ôu, = Au, + Atl AR(AF,AG)] , r 


where [AR(4F,AG)], = (we, - AFA) (Yp T Yp) + (AFD T AFR) (Yo Ya) 


+ (ac, - AG.) (x, - x) + (AG, - 4c.) (x, - 2) /(4ba,) A 
(2.10) 


Note that when AU > O we have AR > O. One suitable choice of 


residual is 
Au, 
ERO aie peer SAR (2.11) 


where ky is a constant. Rather than calculate AR at every 


stage in a time-step it has been found sufficient to freeze it 
at the value calculated for the first stage. Thus a viable 
scheme consists of the stages 


JO -u 

EREA (0) o) 
U =U T van (u + k, AtAR 
uí? = yp! + van (0 + xatar) 


(2.12) 
(3) _ ,,(0) (2) (0) 
U =U $ va, (av + k, AtAR ) 


vl = yO 4 v(s0 + katar) 


Another, equally successful, device is to add the correction AR 
only at the end of the step. Then, instead of (2.12), we have 
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we moe 
yi?) = u® $ va, au‘? 
0) = 0 + ya a ae 
yu) = y) a va au?) 
re REC Nee) 
ott =o!) a katar ® 
2 i 


where k, is another constant. ‘the above schemes have been used 
to advance to the steady state both without and with the use 

of multiple grids. By suitable choices of the constants the 
local Courant number can be made greater than 2 before the 
onset of instability. 


2.4 Flow-tangency boundary condition 


The condition that the flow be tangential to the body 
surface is implemented here by a modification of the scheme 
used for the interior of the field and involves an image 
principle. Fig. 2 shows the body surface passing through a cell 
vertex 1, with the cells B and C in the interior of the field 
and the slope of the surface at 1 given by the angle a between 
the local tangent and the x-axis. Given values of U at cell 
vertices along the body surface and within the field at some 


time, it is required to determine the total change Su, at 1 in 


the next time step, subject to the tangency condition 


v 
atia tana. (2.14) 


Fig. 2 Geometry of cell-vertex scheme at aerofoil surface 
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Firstly, a 'non-reflective' or 'incident' change is calculated 
without reference to the tangency condition or the image 
principle, the procedure being as follows for the Lax-Wendroff 
cell-vertex scheme: 

(i) The first order changes Au, and AU, for the cells B and 


C are obtained from equation (2.4). 


(ii) The 'non-reflective' or 'incident' total change at l, Buy, 
is obtained from equations (2.6) and (2.7), with AU, and Au, 
taken to be zero but with extrapolated values for AA, and 4a): 


The extension for multi-stage cell-vertex schemes is 
straightforward. For the scheme described by equations (2.12), 
for example, we have su; = Ge - Ul where ey is given by 
(2.12) with AU, given by equations (2.4) and (2.6) and AR, 
given by (2.10); while AU, and Au, are again taken to be 


zero. 
We then take, for the 'predicted' change at 1, 
P = o6yt 
ul 28u) 5 (2.15) 


Finally, the predicted changes in p, pu and pv are "corrected'’ 
by setting 
i p 
öp, = Pp, 
* 
(pu) = (ea) y cos a - (ou), (2.16) 


(oa,); sin a - (ov), + 


il} 


6 (pv) 1 


a P P 

where a) = (ew, + 5 (ow?) cos a + (om, + sov?) sin a. 
Equations (16) imply that the corrected change in eas where 
a, is the velocity component tangential to the surface, is 
equal to the predicted value, and that the change in Paar where 


as is the normal velocity component, is cancelled by its image. 
Note that the choice of formulae, in particular the choice 

for obtaining the above predicted quantities, has been 

decided on essentially by trial. Formulae based on the theory 

of characteristics were tried and discarded. Physical notions 

have provided some guidance but the over-riding criterion has 
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been rapidity and ease of convergence to the state where the 
Euler equations and the boundary conditions are satisfied 
simultaneously. Thus in the steady state, when Au, = AU, =O, 


the flux balance for each cell bounded by the surface will be 
correct to second-order, and the tangency condition will be 
satisfied exactly at the cell vertices on the surface. No 
extrapolation is involved in the flux balance; the only 
approximation is the trapezoid rule. It is only in the advance 
to the converged state that extrapolated values for aA, and 


4A, are required, and the choice is not critical. Since 


second-order accuracy is obtained for arbitrary grids, both 
non-smooth and smooth, there again appears to be an advantage 
over cell-average schemes. 


2.5 Numerical stability and smoothing 


The onset of numerical instability places an upper limit on 
the step sizes At that can be used in advancing to the steady 
state. The permissible maximum varies from cell to cell, 
depending on the geometry of the cell and the local flow 
conditions. For the Lax-Wendroff scheme the appropriate 
condition for numerical stability for the cell C (Fig. l) is 
taken to be 


OA AA 


At < min 7 D i £ : 
Judy” - vax" | + ade Jusy™ - vax" | + adm 
(2.17) 
where 

2 m 
T (Xo +x,- x) - x4) AINS C aaa. (x, + 5.5%, 7 x,) ' 
ny = + - - a = =- - 

Wings Wot va Squaw gn AYAY tivo Yi Y3) + 


Ag = (ax)? + ayt) Ò” $ hae (ax)? + (ay)? 


, 


and a = (p/o) *. The condition (2.17) seems satisfactory for 
present purposes although, strictly, it applies not to our 

basic equations (2.1) but to the full Euler equations which 
include an energy equation and in which variations in time are 
correctly represented. For the three-component system (2.1) the 
condition corresponding to (2.17) is much more complicated: 

the speed of propagation of small disturbances differs from that 
for the full Euler equations. Fortunately the difference is 
small at moderate Mach numbers. To evaluate AU, from equation 
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(2.4) we set (At), = At where At ax is given by (2.17). For 


the evaluation of SU, from equation (2.7) we set 
(at), = min (ae) ar (AED pr (AED gr (AE) p). (2.18) 


For the multi-stage schemes At is taken to be a multiple of the 
value given by equation (2.18). This multiple, the Courant 
number v, is kept fixed over the entire field. 


Numerical smoothing is needed, to keep spatial oscillations 
within acceptable bounds. Several forms of smoothing scheme 
are feasible. The scheme adopted here is derived from numerical 
experiments and is an elaboration of that suggested by Ni 


(1981). The smoothing change RA applied at the vertex 1 
(Fig. 1) is 


S = U U U U - 
su, = WG, +O, + 0, + EE ANDY (2.19) 


where U, = (0, + Uy +U, + Ug)» 


and so on, and the smoothing coefficient u is given by 
u= ybs + £(p,R),; (2.20) 


where Yo is a constant, As is an average cell dimension, and 


R is an average residual given by 


tel fl 
ponte IO A bere oo Ce Pe 
atl atl, [atl 


R= 


The term f (P:R) is introduced to provide strong smoothing in 
the vicinity of a shock but much less elsewhere, and has a form 


such that f > O when Ap > O. 


The change (2.19) is implemented to yield new smoothed 
values before calculating the changes given in sections 2.2 to 
2.4. For smooth grids the flux balance for a cell remains 
second-order accurate in the steady state, except of course in 
the neighbourhood of a shock wave. For non-smooth grids 
second-order accuracy can be maintained by setting Uo ~ As. 


3. MULTIGRID SCHEMES 
3.1 Principles for multigrid acceleration 


When a multigrid scheme is adopted in the iterative solution 


of elliptic equations the role of the calculations on the 
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coarser grids of a cycle is to damp errors of wavelengths of 
the order of the dimensions of the grid elements or cells. 
Since the iterative process can be interpreted as a marching 
process it seems plausible that the advance to the steady state 
by solution of hyperbolic equations can also be accelerated by 
the use of multigrids, with the same role for the coarse-grid 
calculations. This seems to be the view taken by Jameson 
(1983), who has demonstrated the marked power of multigrid 
acceleration in hyperbolic systems. (Ni (1981), also 
demonstrated such a power but he gave little indication of his 
guiding principles.) In the course of developing the present 
multigrid schemes it has been helpful to take a different view. 


PRS NS Seeger 


Fig. 3 Multigrid solution of Cauchy's initial-value problem 
for f(x,t) = 0. 


We begin by recalling the solution of Cauchy's initial-value 
problem for the simple hyperbolic equation f(x,t) = 0. Fig 3 
is a sketch of a grid in the x,t plane showing a line t = ty on 


which initial conditions are supposedly given, a point P at 
the time tor at which the solution is required, and the pair of 


characteristics through P which, together with t = tor define 


the domain of dependence of P. The uniqueness theorem states 
that conditions at P depend only on conditions on the segment 
of t= to that is bounded by the characteristics through P. 


Let us suppose that the time interval tp - to is much larger 


than the interval At given by the stability condition for the 
Spatial interval Ax. We could, of course, advance from t_ to 
fe) 


tp to determine conditions at P by one of the established 


explicit or implicit methods. In such an explicit method we 
would take several steps At in each of which we would calculate 
the conditions at grid points Ax apart on a line t = const. 
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For each grid point the calculation would be simple but there 
would be a large number of calculations. In an implicit method 
we might take only one large step but the calculation of 
conditions at P would be more complex, involving the 
simultaneous solution of a set of equations for conditions at 
the other points along t = tp: But we can observe, in the 


light of the uniqueness theorem, that both types of method 
would be unnecessarily laborious: the conditions at P are 
completely determined by conditions at the grid points from Q 


to R'on-t = tor and it is not theoretically necessary to deal 


with grid points lying between to and er or with points other 
than P itself on Yas Thus the multigrid scheme can be regarded 


as a particular way of reducing the effort required to calculate 
the solution at P, in which the number of grid points involved is 
reduced toward the minimum required by the uniqueness theorem. 

A plausible scheme might involve an advance from to to to in 


successively larger time-steps that are associated with 

successively coarser grids, as indicated on the Figure. In an 
advance to the steady state we would repeat time-step sequences 
of the type shown in Fig. 3. Thus we would advance from to to 


t_, interpolate between the widely-spaced calculated results at 
t to obtain a new set of initial conditions specified at 
closely-spaced intervals along t= to and advance beyond to in 


the same way as we advanced from to to Soi For the process to 


succeed we require that calculations on a coarse grid 


(i) preserve the initially specified information that is needed 
to determine the solution at P, and 


(ii) advance the solution efficiently towards the required 
state. 


Some guiding principles for the construction of a multigrid 
scheme follow. To preserve information from a previous, finer, 
grid a coarse-grid calculation should itself satisfy the Euler 
equations and be formulated so that changes 6U vanish when 
corresponding changes in the previous step on the finer grid 
vanish. Thus in interpolating coarse-grid results, only changes 
should be interpolated, not the dependent variables themselves. 
To advance the solution efficiently boundary quantities should 
be updated on the coarse grids as well as on the first, finest, 
grid of a cycle. Note that if a coarse-grid calculation were 
intended only to damp errors of the appropriate wavelength it 
would be natural to set instead ŝU = O on the boundaries; this 
is found greatly to reduce the rate of convergence to the steady 
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state. Note also that while implicit algorithms may be 
considered in the search for computational efficiency, such 
algorithms do not readily fit the view we are taking of the 
multigrid process; in a multigrid framework explicit 
algorithms may be as efficient as implicit ones. Only explicit 
algorithms are considered here. 


3.2 A multigrid scheme with Lax-Wendroff time-stepping 


To advance the solution on a coarse grid in the required 
direction the coarse-grid changes could be obtained from 
equations (2.6) and (2.7) but, if the first-order changes 
corresponding to AUas AU: etc. are required for (2.6) were 


evaluated directly from (2.4) on the coarse grid, the results 
would be unacceptably inaccurate and, moreover, the changes on 
the coarse grid would not vanish as the changes on the previous, 
finer, grid vanish. Ni (1981) proposes that the first-order 
changes required for (2.6) be replaced by quantities derived 
from the overall changes on the previous grid but he does not 
elaborate further. The present scheme is a development of Ni's 
proposal that follows the guidelines of the previous section. 


Suppose that the coarse-grid calculation under consideration 


+ 
involves an advance At™ Da RN - t°. The Taylor-series 


expansion for the corresponding change in U can be written 


l n+l av \" +1 a (a ieee 4 
sut! =y + = ea) a ug, w) fat) 
(8u) = u t At Hoe (oe 7 Fiss, s 


while for the previous time-step, on the next finer grid, there 
is another expansion 


i n n n, 2 
oe eet. (Yur (2 s EKRA 


at \ot 2 


, 


so that elimination of (au/at)™ yields 


1 nat 1 (a fa 
(6u) 9* = (6U i (2 (au n+1,2 n, n+l 
(60) oT +5 oe lee (At 7)" + At At ` 


Substitution from equations (2.1 
differentiation yields 


n+l 
aott = (gy)? AE a [a Ea ia: caai is 
ace 2 (əx bu dt/ ~ dy QU dt 


(or? + setae) $ 


) and change in the order of 
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and substitution of 
au _ (6u)? 
YS mye: + O(At) 
At 
finally yields 


n+l j n n+l 
aual = (gu)? E -h E sv) + @& su) a x 
at” x Y 


( attt 


PAEA sais (3.1) 


Fig. 4 Geometry of finite-volume scheme for multiple grids 


Fig. 4 shows a coarse grid, with the cell vertices marked by 
crosses, together with the finer grid used for the previous 

time step. The former is obtained by omitting every second line 
of the latter. The total change at the vertex 1 on the coarse 
grid is taken to be the change for the element 3579. The first- 
order part of the change Av, is an average of the values of 


(su) (att Aath) over the element, namely 


n n n n 
ayn z (SU) y+ AA, + (80) 5 4a,+ (80) ¢° AAG + (8U) 5+ Aa, antl 
i Aan + Oa, + 4a, + 4A) at” 


where (605 =< (ao? + (607 + (603 + au’). 


Note that this average conserves changes in mass and momentum. 


The second-order part of the change is obtained from equation 
(3.1) as before, by application of the integral theorem of 
Gauss and the trapezoid rule. Again, for convenience, let 
AF= (3F/3U) ôU and AG=(3G/3U) SU. The final result for the 
total change at 1 is 


320 HALL 


EE ntl ii 1G z= - 
aut = aut + (ue, AF.) (Yg Ys) + (AF, Fg) (Y3 Y7) 


n 


n+l / n+l 
MPH (cia Ae 
+ (AG, = AG,) (xg = x) + (AGg = AG.) (x, - x?) E ( TAA, l 


(3.3) 


where OA, is the area of the element 3579. 


Equations (3.2) and (3.3) are evaluated on all but the 
finest grid of each cycle. It can be seen that if (8u)" + o 


then (u)? * 40, which is a necessary feature if the scheme is 
to be convergent. 

The boundary conditions are implemented on each coarse grid 
as well as on the finest. The procedure is straightforward and 
follows that given in section 2.4 except that, instead of 
evaluating the first-order changes from equation (2.4), we take 
averages given by (3.2) and, instead of solving (2.7) we solve 
(3.3). 


The time steps for a coarse grid are specified by making use 
of the time steps already calculated for the finest grid of the 
cycle, In the first coarse grid in the cycle it can be 
deduced, from inspection of Fig. 4 together with equations (3.2) 

+ 
and (3.3), that (at,)" lz (at,)". For subsequent coarse grids 


we take (Fig. 4) 
n+l _ n n n 
(ae) = 2. min (at), (ates (At) Cr a), (3.4) 


where A, B, C and D are points on the grid that is two levels 
finer than the current grid. Thus if the time steps were all 
equal on the first, fine, grid the complete cycle of time steps 
would be At, At, 2At, 4At, 8At, etc. 


Numerical smoothing of the type defined by equation (2.19) 
cannot be applied on the coarser grids, because the resulting 


Changes would not tend to zero when (6U)? + O and the cycles 
would not converge. The only smoothing applied on coarse grids 


here is that implied by the averaging of valu £ (6u)™ 
equation (3,2), a Fr agi 


Interpolation for intermediate vertices is necessary at the 
end of calculations on the coarsest grid of a cycle, to provide 
a complete set of initial conditions for calculation on the 
first, fine, grid of the next cycle. Here we simply 
interpolate the changes linearly. This has no effect on the 
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accuracy of the steady-state results, because only changes in 
time are interpolated. 


Smooth U, equation (2.19) 
Evaluate first-order change AU, equation (2.4) 


Evaluate total change 5U, equations (2.6) and (2.7) 


Average previous changes ÔU, equation (3.2) 
Evaluate change ÔU, equation (3.3) 
Interpolate changes 6U to h-grid 


Evaluate change 5U, equation (3.3) 


‘Average previous changes 6U, equation (3.2) 
Interpolate changes ÔU to h-grid 


Fig. 5 Cyclic procedure in multigrid scheme with Lax-Wendroff 
time-stepping. 


A multigrid cycle is summarized in the computational flow 
diagram sketched in Fig. 5. The grids in a cycle are denoted 
by h, 2h, etc, where h is a typical cell dimension on the 
finest grid. The overall multigrid scheme has a progressive 
character, as indicated in the flow diagram sketched in Fig. 6; 
the number of grid levels in a cycle is denoted by 2, and the 
number of cycles at a fixed number of levels by i. The 
calculation begins with relatively large cells on the finest 
grid in each cycle, together with relatively few grid levels 
per cycle. Extra levels are added progressively in the course 
of the calculation, with each additional level having a finer 
grid than the finest hitherto. Linear interpolation is used 
when adding an extra level. 
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Interpolate 
to finer grid 


ith l-level 
rid 
cycle 


Fig. 6 Progressive multigrid scheme 


3.3 Multigrid with Lax-Wendroff and multi-stage time-stepping 


In the previous section a multigrid scheme was described in 
which Lax-Wendroff time-stepping is used at every step of a 
multigrid cycle. It is possible also to use multi-stage time- 
stepping at every step of the cycle but, as noted in the 
Introduction, it is found that a composite multigrid 
scheme, with both Lax-Wendroff and multi-stage time-stepping, 
gives converged results in less computing time than either 
Lax-Wendroff or multi-stage time-stepping on its own. 


: 


For the composite scheme, the Lax-Wendroff time-stepping of 
section 2.2 is retained for the calculation on the first, finest 
grid of the multigrid cycle, but for each of the coarse grids 
a four-stage time-stepping algorithm similar to (2.12) is used. 


The algorithm includes a forcing function P and consists of the 
stages 
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19 oh 

i. = us? va [ ays?) + AtP] 

phe = uf? val aufi + AtP] 

ui?) = ups va [au + Ate] } (35) 
of) uf? vf avs?) + AtP] 

ae oe | 


+l ntl n 
su = n On 
where Oe a and %3 are constants and v is the Courant number. 


The residual avi?) at the vertex 1 in Fig. 4 is given by a 


formula analogous to (2.6), namely 


AA. + AU, . OA, + AU, » ÔA 


uu.) eee 7 CAERA) 3 3 5 5 
Se E aa ar AS E 
2h'1 DA, + OA, + Oa, + a, 


where the subscripts 7, 9, 3 and 5 refer to coarse-grid cells 
with 'centres' at 7, 9, 3 and 5 respectively, and the first- 
order changes Au, AU: etc, are given by the formula analogous 


to (2.4). The forcing function P is given by 


ws, avi?) 
se | gah) epee! 367 
P= (vit Meu een tore 


where avs?) is given by the formula (3.6); Au, is given by 


(cf equation (3.2)) 


(80) ae oa, + (60) g a, + (80) (+ oa, + (80) p 4a, s! 


(Au,), = 
hil AA, + AA + da, + AAD 
(3.8) 
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i : 
where (8U), =] CAR + (SU), + (6U,), + (60,) 4) i 


and AR.. is a stabilizing term given by (Fig. 4, cf equation 


2h 

(2.10) 

(OR), = RAR, - OF) (Yg - Yg) + (AF, - AFQ) (Y3 ~ Y7) 

+ (AG, = AG,) (xg = x5) + (AGg = AG.) (x, =x) (4AA) 5 (3.9) 


where AF, =[ (9F/2U) . 6U,] 31 ete, and AA, is the area of the 
cell 3579. Note that when Su, >oO 


(0) 
AU 
2h 
PO Te 


and hence 


Uh > Yh 
as required. In addition, the forcing function is intended to 
transmit changes from the previous h grid; substitution for P 


as given by (3.7) into the expression for i in (3.5) yields 


(1) _ io AUA 
Uh = Uy + va, dt DAE] + ARO, , 


and it can be seen from (3.8) and (3.9) that the terms within 
the square brackets constitute a residual given by changes on 
the previous grid. 


The other features of the multigrid scheme are 
straightforward. The time steps for a coarse grid are now vAt 
where v is the Courant number and At is the Lax-Wendroff time- 
step, as given by equation (3.4) in section 3.2. As with 
Lax-Wendroff multigrid, no numerical smoothing of the type 
defined by equation (2.19) is applied on any coarse grid, and 
coarse-grid changes are interpolated linearly to obtain a 
complete set of initial conditions for calculations on the 
first, fine, grid of the next cycle. The overall multigrid 
scheme is identical to that sketched in Fig. 6. 


Of course, the procedure followed in each cycle differs from 
that sketched in Fig. 5 for the Lax-Wendroff multigrid; a 
multigrid cycle of the composite scheme is summarized in the 
flow diagram sketched in Fig. 7. 
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Smooth U, equation (2.19) 


Evaluate forcing function P, equation (3.7) 
2h | Evaluate change 5u5;", equations (3.5) 


Interpolate change U to h -grid 


4h 


Fig. 7 Cyclic procedure with Lax-Wendroff and multi-stage 
time-stepping 


4. NUMERICAL RESULTS 


4.1 Scope of calculations 


The present Lax-Wendroff method has been used to calculate 
the flow of a perfect gas, with y = 1.4, past the RAE 2822 and 
NACA 0012 aerofoils for a variety of conditions. These are 
listed in Table 1, which shows that for each free-stream 
condition (M, and incidence) at least two calculations were 


performed. For the cases marked by an asterisk the composite 
method was also used to calculate the flow. For all the listed 
calculations the far-field boundary was located at a distance 
of approximately eight chord-lengths from the aerofoil. Other 
calculations not reported here have been performed for 
distances ranging from 4-16 chords; no significant changes in 
the results were found for distances beyond eight chords and, 
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indeed, for subcritical flows no significant changes were 
found over the entire range. 


[i 
jl ne 


Detail at trailing edge 
rofoil, 128 x 16 cells 


Detail at leading edge 


Fig. 8 C-mesh in neighbourhood of ae: 
in total. 


For each aerofoi 


the resulting flow is everywher 
provide a measure of accuracy in the absence of shock waves, 


where comparisons with solutions of the potential equation can 
be especially useful. The flows in the other two cases are 
transonic; the cases are among those selected for a recent 
AGARD test (FDP WG-O7. Report in preparation), and each 
presents a challenge. The RAE 2822 is an aerofoil with a 
leading-edge shape that rapidly accelerates the flow over the 
nose and a rear camber that induces substantial flow gradients 
locally. At the chosen free-stream condition there is in 


addition a large region of supersonic flow over the upper 
surface terminated by a strong shock wave. Thus this case 
provides a test of capability for dealing with strong shock 
waves and with substantial flow gradients at nose and rear. 


1 the first case is a condition for which 
e subsonic, and is included to 
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The transonic case chosen for the NACA 0012 aerofoil tests 
another aspect of the capability for dealing with shock waves. 
There is a weak shock on the lower surface (as well as a 
moderately strong shock on the upper surface) and part of the 
challenge is to capture and reproduce the weak shock as well as 
possible. All these aerofoil calculations have been performed 
on a 'C'-mesh generated by use of a modified version of a 
program provided by Rizzi (1981). The central part of a 
typical mesh is shown in Fig. 8. 


4.2 Converged results and their accuracy 


In Figs. 9 to 12 converged results for the lift and drag 


coefficients, C, and Cyr are shown for all the cases listed 


in Table 1. To show the variation with cell size the results 


are plotted against (Gere x 104, where (As x 107) is an 
average cell dimension expressed as a percentage of the 
aerofoil chord. The results are taken to be converged when 
there is no further change in the fourth significant figure, 
for Cy over at least 100 multigrid cycles. It is desirable 
not only that the values of C, and Cp should tend to the exact 
solution as As > O but that the slope of the curve should be 
small. 


Key:- © Present method 
® BGK (1972) 
+ Baker (1979) 
Lytton (1984) 


lAs}? x 10° 


0.002 


0 2 4 6 8 10 12 14 16 18 20 
(As)? x 104 


Fig. 9 Variation of C, and C_ with (As)? mae obo È 
incidence = 1°; D 2, M, = 0.676, 
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Key © Present method ` 
0.34 © BGK (1972) 
® ARC 2D (algebraic grid) 
x ARC 2D (Laplacian grid) 


(Ms)? x 104 
0.006 : 
Co 
0.004 
0.002 
0 2 4 6 8 10 12 14 16 18 20 
(As)? x 10° 


Fig. 10 Variation of C, and cy with Se NACA 0012, 


i= 0.63, incidence = rg 


Figs 9 and 10 show results for the two subcritical cases, 
for the RAE 2822 and the NACA 0012 aerofoils respectively. 
Included for comparison are values of C, from solution of the 


potential equation by the well-established methods of Bauer, 
Garabedian and Korn (1972) (BGK) and Baker (1979) (Fig. 9 only), 
and from a solution of the Euler equations by a recent method 
of Lytton (1984) (Fig. 9 only). Also included in Fig. 10 are 
the only available variations with As of the solution from 

an alternative method (Flores et al, 1984) (ARC 2D), due to 
Steger and Pulliam, for the Euler equations. It can be seen 
from the figures that the present method gives results that 
closely approximate the solutions of the potential equation 


when a fine grid (As= 0.01, 192 cells around the aerofoil) is 


used. Even with 96 cells the method may prove to be accurate 


enough for some purposes, and this possibility is discussed 
when we deal with the practical calculation of supercritical 


flows. 


Figs. 11 and 12 show results for the two supercritical cases 
listed in Table 1. Included are more results obtained by the 
use of the ARC 2D method (Fig. 11) and the method of Lytton 
(1984) and a recent solution of the Euler equations by Jameson 
(1984), (Fig. 12). A feature of these supercritical results 
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is their scatter; the differences between different methods 
appear to be as large as the differences between coarse-grid 
and fine-grid results from the present method. To seek an 
explanation we can examine local Mach numbers, which are well 
correlated with the corresponding pressures but seem a more 
sensitive indicator of the differences between methods. In 
Figs. 13 to 15 distributions around the aerofoil surface of 
Mach number and (Fig. 14) total pressure, given by the present 
method (with 192 cells around the aerofoil), are compared with 
corresponding distributions provided by Lytton (160 cells 
around aerofoil). The latter are shown in full line, the 
former in dotted line. The appropriate Rankine-Hugonoit jumps in 
Mach number, for normal shock waves, are marke on the Figures 
and denoted by the letters RH. 


Key:- © Present method 
a Lytton (1984) 
œ ARC 2D (algebraic grid) 
1.16 x ARC 2D (Laplacian grid) 
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(As)? x 10° 
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ig. cent $ 2 
Fig. ll Variation of Ci and Cp with (ås)“. RAE 2822, Mo = 0.75, 


incidence = 3°. 
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Key: © Present method 
Lytton (1984) 
a Jameson (1984) 
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Fig. 12 Variation of C, and Cp with (ayes NACA 0012, 
M, = 0.80, incidence = 125°. 


Figs. 13 and 14 help to explain the differences in lift and 
drag shown in Fig. 11 for the RAE 2822 aerofoil. The present 
method gives a significantly higher Mach number over the whole 
of the upper surface, with the difference being pronounced 
near the leading edge. This is sufficient to outweigh the 
additional lift, in Lytton's distribution, from having a shock 
wave that is both steeper and slightly further aft. The total 
pressure distributions in Fig. 14 indicate a possible cause of 
the difference in levels of Mach number. The total pressure Po 


is given by 
= yip 
P» pil + 7 M 
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and, in an exact solution, the quantity plotted (i= Po) would 


be zero forward of the shock. The distributions show that 
there are much larger errors near the leading-edge in Lytton's 
result. These may be sufficient to account for the difference 
in Mach number. Incidentally, such errors may also account 
for the difference shown in Fig. 9 between Lytton's C, and the 


values from potential flow solutions. 


Key: ——— Lytton (1984) 
---- Present method 


0 0.2 0.4 0.6 0.8 1.0 


Fig. 13 Comparison of Mach number distributions. RAE 2822 
tig 0.75, incidence = sas a 
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Key —— Lytton (1984) 
---- Present method 


Fig. 14 Comparison of total pressure distributions. RAE 2822, 
M, = 0.75, incidence = 3°. 


Fig. 15 helps to explain the differences shown in Fig. 12 for 
the NACA 0012 aerofoil with M, = 0.80. The differences in 


the Mach number distributions are small compared with those 
shown in Fig. 13, and yet they are sufficient to give a 
difference of nearly 2% in lift. Here, the differences are 
seen to be mainly at the shock, which is captured better by 
Lytton, and in the Mach number levels, which might again be 
attributable to differences in the magnitudes of the errors 
generated near the leading edge. 
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---- Present method 
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Fig. 15 Comparison of Mach number distributions. NACA 0012, 
ni 0.80, incidence = 1.25 . 


4.3 Convergence histories for present calculations, and 
practical implications 


An extended convergence history for C. 


and 109,0) 


L At 


mean 

is derived from the 
mean 
change 6p at a vertex over the local time step At for the finest 
gxid of each cycle, with the mean taken over the entire field. 
Variations with time, in CPU seconds (CRAY-1S), are shown*. 
Each arrow on the residual history curve points to a stage of 
the calculation over which the number of grid levels is kept 
constant (the stage begins with a sharp peak in the residual) ; 
the corresponding caption gives the number of cycles in that 
stage, the number of levels, and the cycle rate. It appears 
that in each stage the residual falls at a roughly uniform rate 
following an initial rapid fall; there is no sign of any 
levelling off. An examination of the convergence history for 


is shown in Fig. 16. The residual Pa 


* All calculations were performed on the CRAY-1S computer at the 
Royal Aircraft Establishment, using the vector-processing 
capability of the machine. The CPU times given in Fig. 16 to 20 
were all obtained with the Lax-Wendroff multigrid scheme. 
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C shows that there is no need in practice to drive the 

residual down to 10 say. In this example the error in lift 
= - hall ball 

acy where Ac, Ic, ce| and ci, is the converged result (to 


five significant figures), is within 0.5% of ci after about 
20 seconds, when the residual is just below lo ~. The marks 
A, B and C on the Ci history are the times beyond which the 
error is less than 1%, 0.5% and 0.1% respectively. The same 


marks are used in the histories that follow. 


A B c 
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(o uE 
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Fig. 16 Extended convergence history for C, and 109,,|60/At| nean 
RAE 2822, M, = 0.75, incidence = 3°. 


Convergence histories with an expanded time scale, for Cu cy 
and the residual, for the RAE 2822 at M, = 0.75 and the NACA 


v 
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zÉ 


igs. 17 and 16 respectively. 
histories for calculations that terminate with 
cells on the finest grid, we show 
terminate with 4~-level cycles 
inest grid (in dotted line). Por 
, 0.5% and 0.18 error marks are 
t can be seen that for both 

c and a rapid approach, initially, 
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towards the steady state and little further change thereafter; 
the 18 error mark in C, is reached in about 5 seconds and 14 
seconds respectively. The convergence histories for the 
calculations with 128 x 16 cells follow the histories for the 
calculations with 256 x 32 cells fairly closely but the various 
error marks in C, and C, are much sore rapidly approached. 
A 
a"? 8 
[E] 
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Fig. 17 Convergence history for and 
RAE 2622, M, = 0.75, pale 5 S 109101 4°/* hasan 
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Since the converged values of C, and C, appear to differ, in 


taking 128 x 16 rather than 256 x 32 cells, by no more than the 
difference between corresponding results from different methods 
the practical advantage in confining calculations to the 
coarser grid seem obvious: not only would there be a 
appreciable savings in computer time, there would also be 
considerable savings in computer memory requirements. To 
confirm this assessment we consider the corresponding coarse~ 
grid distributions of Mach number. 


see 


Keyi- = Finest grid 294 w 22 celis, levels 
. -=-= Finest grid 120w 16 cette, 4 levels 


R * » 
CPU time » seconds CRAT- 1S 


Fig. 18 Convergence history for C,, Cp and [407A | nsan 
NACA 0012, M, = 0,00, incidence = 1.3%. 


In Pigs. 19 and 20 comparisons 
from converged, S-Level, 256 x 32 cell solutions with oii 
distributions obtained after 100 4-level cycles shara the 
finest grid consists of 128 x 16 cells. cing fron wnifors fiov. 
reach the latter is 3.6 seconds CPU, starting 
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Converged 5-level solution, 256 x 32 cells max 
---- After 100 4-level cycled, 128 x 16 cells max 
3.6s CPU CRAY-1S. 


Key: 


0 0.2 04 2 0.6 0.8 1.0 
Fig. 19 Mach number distributions, RAE 2822, Mo = 0.75, 
incidence = 3. 


Fig. 19 shows that for the RAE 2822 aerofoil at at 0.75 the 


128 x 16 grid is adequate; the difference between solutions on 
the 128 x 16 and 256 x 32 grids is small and, indeed, is 
smaller than the difference between methods shown earlier in 
Fig. 13. The difference that does exist is mainly attributable 
to a wider spread of the shock on the coarser grid. There is, 
in fact, no significant difference, graphically, between the 
Mach number distribution shown, for 128 x 16 cells at 100 
cycles, and the converged solution on the same grid. This 
might be expected from the position in this case of the 1% 
error mark A' - at about 3.2 seconds (Fig. 17). The same 
conclusion regarding the adequacy of the grid can be drawn for 
the other case, with the qualification that a computer time of 
3.6 seconds is no longer enough. This would be expected from 
the convergence histories shown in Fig. 18. Fig. 20 shows the 
Mach number for the NACA 0012 aerofoil at M, = 0.80. It can 
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be seen that the weak shock on the lower surface is not well 
represented after 100 cycles on the coarse grid. Included on 
the Figure are points from the converged 128 x 16 solution, 
and these demonstrate that the 128 x 16 grid is adequate for 
the capture of such shocks. In fact the distribution after 
about 7 seconds is indistinguishable from the converged 
distribution, and it differs from the 256 x 32 distribution by 
no more than the differences shown in Fig. 15. 


Key:- Converged 5- level solution, 256 x 32 cells max 
---- After 100 4-level cycles, 128 x 16 cells max 
3.6s CPU CRAY-1S 


© © Converged 4-level solution, 128 x 16 cells max 


0 0.2 0.4 4 06 08 1.0 
Fig. 20 Mach number distributions, NACA 0012, M, = 0.80, 
incidence = 1.25°. 


All the above results were obtained by use of the Lax- 
Wendroff multigrid scheme. For the cases marked by an asterisk 
in Table 1, use of the composite multigrid scheme yielded 
essentially identical converged results. However the 
convergence histories differ. Since the 128 x 16 grid seems 
fine enough for practical purposes we show in Figs. 21 and 22 
comparisons of convergence histories, calculated using this 
grid, for the lift coefficient C, of the RAE 2822 and NACA 


0012 aerofoils respectively. 


340 HALL 


Key:- ——— Composite time -stepping , v= 2.8 
---- Lax-Wendroff time - stepping 


0 5 
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Fig. 21 Convergence histories for C. 
incidence = 3 
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RAE 2822, M, = 0.75, 


CELL-VERTEX MULTIGRID SCHEMES 341 


0.40 


0.35 


0.30 


Key:- ——— Composite time - stepping, v=2.8 
---- Lax-Wendroff time -stepping 


0 5 10 
CPU time - seconds CRAY -1S 


Fig. 22 Convergence histories for Cue NACA 0012, M, = 0.80, 
incidence = 1.25. 


The scales are not the same as in Figs. 17 and 18 but are 
expanded so as to show up more clearly the differences 
between the convergence histories (in broken and full lines). 
Also shown on Figs. 21 and 22 are the 1%, 0.5% and 0.1% error 
marks denoted, as in Figs. 17 and 18, by A, B andc. It can 
be seen that with the Courant number v = 2.8 the composite 
scheme yields significantly faster convergence. If we take 
a calculation to be sufficiently well converged for practical 
purposes at the 0.5% error mark, as suggested by the above 
discussion of Figs. 17 to 20, then the computing times for 
practical convergence are: 
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Thus, results of sufficient accuracy for practical purposes 
can be obtained with a relatively coarse 128 x 16 C-mesh in CPU 
times of 3-5 seconds. It seems worth noting, however, that the 
gains over the Lax-Wendroff scheme may not always be large, so 
that the Lax-Wendroff scheme may be preferred for its relative 
simplicity. 


5. CONCLUDING REMARKS 


The motivation in the present work has been the achievement 
of high accuracy at low cost in computational effort. The cell- 
vertex formulation of Ni (1981) and Denton (1981) has been 
adopted as a starting point because with it second-order 
accuracy may be easily obtained in principle, even for non- 
smooth grids. In achieving this accuracy, for highly stretched 
grids and at body boundaries, some extensions have been made, 
so that cell-vertex schemes now provide a firmer basis for 
practical calculations. To accelerate the convergence to the 
steady state, time-stepping multigrid schemes of Lax-Wendroff 
and multi-stage (Runge-Kutta) type have been developed. An 
alternative approach for hyperbolic problems has been suggested 
which has provided some guidance in the derivation of 
corresponding multigrid algorithms. Multi-stage time-stepping 
had been introduced by Jameson et al (1981) in a cell-average 
formulation of the Euler equations. Here a four-stage scheme 
has been coupled with the cell-vertex formulation and the 
scheme has been stabilized by the addition of a second-order 
time-change rather than a fourth-order dissipation. It has been 
demonstrated that use of the four-stage time-stepping for the 
calculations on the coarse grids in the multigrid cycle gives 
significant reductions in overall computing time. 


While all the above improvements seem worthwhile, practical 
measures of accuracy and speed must include systematic 
comparisons with other methods and such comparisons have yet 
to be made. In the meantime there remains considerable scope 
for further development. 


First, the accuracy of cell-vertex schemes does not appear 
to have been seriously tested for grids that are not smooth. 
Such grids are increasingly encountered as more complex 
configurations are treated. 


Next, the capture of shock waves is open to criticism on 
more than one point. Empirical smoothing is applied, so that 
the strength and position of a captured shock, and the 
magnitide of the associated spatial oscillations that arise 
numerically, are sensitive to variations in the ‘constants’ 
adopted for the smoothing formula. This is, of course, a 
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feature common to all shock-capturing schemes that are not of 
the upwind type. Here the constants have been chosen (partly 
by comparison with results from other methods) and then kept 
fixed for all the calculations performed, whatever the 
aerofoil, the grid, or the free-stream Mach-number. This is 
better than varying the constants to suit the case, but it 
would be better still if no empirical choice were involved. 
However, even if we accept the need for empirical smoothing 
there is still scope for improvement. The formula adopted 
here is essentially that proposed by Ni (1981), which has the 
merit of being simple but is, in fact, non-conservative. A 
new, conservative, formula may give results that are less 
sensitive to variations in the smoothing constants. 


Finally, there seems to be considerable scope for 
improvement in the speed of multigrid algorithms for the 
Euler equations. Lax-Wendroff and multi-stage algorithms were 
respectively proposed by Ni (1981) and Jameson (1983). These 
were pioneering extensions of the multigrid concept to solution 
of the Euler equations, where the aim was to demonstrate 
feasibility rather than to find the fastest algorithm. It has 
been shown here that a combination of the Lax-Wendroff and 
multi-stage algorithms gives a significant reduction.in 
computing time, but no attempt has been made to provide 
theoretical support, which might show how the composite scheme 
could be optimized, or to explore completely different 
alternatives. 
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and 
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l. INTRODUCTION 


Let a discretisation of a partial differential equation be 
denoted by 


Ay =b (lil) 
This is a system of linear equations. 


Stationary single-step iterative methods for solving (1.1) can 
be written as 


ytl = y” + Bib - ay”) (1.2) 


where the matrix B, which often is not formed explicitly, 
depends on the iterative method that is chosen. 


If the discretisation is nonlinear, it is assumed that some 
linearisation leads to a linear system (1.1). The iterations 
(1.2) will then be inner iterations within an outer iteration 


on the nonlinearity. 


In practical applications the number of unknowns in (1.1) 
can be quite large, and simple and straightforward iterative 
methods normally converge slowly. A significant recent 
development in numerical mathematics is the rise of 
preconditioned conjugate gradient methods (CG) and multigrid 
methods (MG). With respect to linear problems, both can be 
regarded as acceleration techniques for basic iterative methods. 
This provides us with a unifying point of view. 


348 SONNEVELD, WESSELING AND DE ZEEUW 


The purpose of this paper is to briefly review CG and MG 
from the viewpoint just mentioned, and to introduce the reader 
to the literature. 


For completeness it should be mentioned that MG can also 
be applied directly to nonlinear problems, and this may lead 
to a more efficient solution method than linearisation 
followed by application of a fast linear solver. See 
(Hackbusch and Trottenberg, 1982) for a general introduction to 
MG. Up-to-date accounts of CG are given in (Hageman and Young, 
1981; Golub and van Loan, 1983). 


2. BASIC ITERATIVE METHODS 


"Basic iterative method" (BIM for short) is a loose 
appellation for methods of type (1.2) that are simple and easy 
to implement. For a review of BIMs (including some that are 
not so simple) see (Young, 1971, or Varga, 1962). Examples are 
the Gauss-Seidel (GS) and SOR (successive over-relaxation) 
methods. The computational complexity of these methods for a 
typical problem such as the two-dimensional Poisson equation 


is otni) and omn?/2) , respectively, with N the number of grid 
points. These numbers explain both the popularity of SOR in 
the sixties, and the large computer times needed for large N. 
Accelerated by CG or MG, the computational complexity becomes 


5 
O(N /4, (conjecture) or O(N), respectively, as will be 
discussed later. 


The most popular BIMs used with CG and MG are GS methods and 
incomplete factorisation (IF) methods. 


The reader is assumed to be familiar with GS methods. They 
come in various orderings (lexicographic, red-black etc.) and 
may be pointwise or blockwise (by lines or planes mostly). we 
give a short outline of IF methods. 


An incomple LU-factorization (ILU) of the matrix A consists 
of a lower triangular matrix L and an upper triangular matrix 
U such that 


We=A+C (2.1) 


where C represents the error matrix. With c = 0 (complete 
factorization) L and U usually are much less sparse than A, 
which leads to large storage and computer time requirements. 
By allowing C # O sparsity of L and U can be obtained. For 
example, if A represents a typical two-dimensional 7-point 
finite difference stencil of a second order elliptic partial 
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difference operator on a rectangular grid, the sparsity 
patterns of L, U, A and C could be as in Fig. 2.1. 


Fig. 2.1. Sparsity patterns. - - ~- : c 


For simple formulae for the computation of L, U and C on a 
rectangular computational grid and theoretical results on the 
existence of L and U see (Meijerink and van der Vorst, 1977, 
1981, Wesseling, 1982A, Sonneveld et al., 1985). 


Equation (2.1) leads to the following iterative method for 
solving (1.1): 


yrth = y” + (uu) Mtb - ay”) (2.2) 


Using (2.1) this can be rewritten as 


itt = (nu) 72 + cy”) (2.3) 


Hence, A is not needed, and L and U can be stored at the 
location of A. Usually C is not stored but computed. Hence, 
no extra storage is required. Because C is very sparse 

(cf. Fig. 2.1) the right hand side of (2.3) comes cheap. 
iteration with (2.3) takes 18N operations, generation of L 
and U takes 21N operations, with N the number of unknowns: see 


for example (Wesseling, 1982A) . 


One 


factorization (IBLU or ILLU, incomplete 
follows. On a rectangular 
l lines the matrix A has 


Incomplete block LU- 
line LU-factorization) of A goes as 
grid with n vertical and n horizonta. 
the following structure: 
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Ae (2.4) 


with Lye BL and U; mx m matrices; B; is tridiagonal; L and 


U are lower and upper bidiagonal, with sparsity patterns 


{(j,3-1), (j;j)} ana {(j,4), (i,j+1)} respectively. There exists 
a matrix D such that 


A = (L +D)D -(D + U) (2.5) 
where 
o 
o uy 
L 
PO o U, 
L= RSO uss a ' 
SN TE o 
Ua-1 
Lo X 
D 
l 
D, 
D= 3 
D 
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Equation (2.5) is sometimes called a line LU-factorization 
because the blocks in L, D and U correspond to (in this case 
horizontal) lines in the grid. Equation (2.5) can be rewritten 
as 


A=L+D+U+ID'U (2.6) 


One finds that to tu is the following block-diagonal matrix: 


1 . (2.7) 


DaPn-1 n-l 


From (2.6) and (2.7) we deduce the following algorithm for the 
computation of D: 


-1 
= = = = 2,3,006 
D) = Bye Dy = By > BP Mya Lis 2/37e~en 


(2.8) 
The matrix pot is a full m x m matrix, which causes the cost 
pS 


3 
of a line LU-factorization to be O(nm’), as for standard 
LU-factorization. An IBLU factorization is obtained if we 
replace L Die 1 by its tridiagonal part. Thus, algorithm 
Pha on tics 


(2.8) is replaced by: 


D D j E E A E (2° 
D = Bur D; = B; - tridiag (DL! i-r’ popecer 
The IBLU factorization of A is now defined to be 
As (L+D)D> (D+0) +R (2.10) 


with R the error matrix, and Ď the block diagonal matrix with 


blocks D,. 
i 


Detailed formulae for the computation of D are given ages 
(Concus et al., 1985; Sonneveld et al., 1985; Axelsson et èt., 


1983; Meijerink, 1983; Underwood, 1976). 
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From (2.10) the following iterative method is obtained: 


n+l 


(L + DD L(D + U)sy =b- Ay", y = y" + dy” (2.11) 


The cost of one iteration with (2.11) is 37N operations; the 


cose of D and its triangular factorization is 36N operations. 
Storage of 6N reals is required for the triangular factoriza- 


tion of D and for D. For details see (Sonneveld ed al., 1985). 


Both ILU and IBLU can be implemented efficiently on vector 
computers: see (van der Vorst, 1982, 1985; Meurant, 1984; 
Hemker et al., 1983). Variants of ILU are discussed by Behie 
and Forsyth (1984), Gustafsson (1978) and Manteuffel (1979). 


Extension of ILU to three dimensions is straightforward, and 
has been tested in combination with CG and MG by Behie and 
Forsyth (1983). A three-dimensional version of IBLU has been 
proposed by Meijerink (1983) and Kettler and Wesseling (1985). 


3. CONJUGATE GRADIENT ACCELERATION OF BASIC ITERATIVE METHODS 


For an introduction to conjugate gradients (CG) (and 
A Tchebychev) methods, see (Hageman and Young, 1981, or Golub and 
van.Loan, 1983). Here only a brief outline is given. 


One way to look at CG is as follows. Assuming the matrix 
A to be large and sparse we allow ourselves to use A only for 
multiplication with vectors and nothing else. This means that 
polynomials in A can be built. A rather general algorithm then 
is 


n+l n n 
Payta ph p= 0 AO (3-1) 


Here a isa ney to be Peg ogre aa is a polynomial of 
degree n, and z =b- ay? is the uae residue. We find 


n+l n+l o 
em bray” = $4, ANE (3.2) 


with 
$400) = T- ALGO (A) +a 10 (A) te. Ha Otay)? (3.3) 
Hence 


1 
6,6,» (3.4) 


1 
1, = {¥,|¥,(0) = 1, W is polynomial of degree < n} (3.5) 


i 
t 
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anes distinguishing property of OG is that oy is constructed 


1 
Hemel < [yell Y EG (3.6) 


Depending on the choice of norm, different CG variants are 
obtained. 


By itself CG does not achieve an impressive rate of 
convergence. It comes into its own when it is combined with 
preconditioning. Let B be the preconditioning matrix. Then 
a preconditioned CG method is given by: 


po eto. 2° =b HAY» 
T 


n-1 eS | 
p = Pa Pn’ pe Br, 


n 
p =Br +Bp , 
p a (3.7) 


n fi n° on 
y"y +07, =p /o., 0," P Me 
zetl if 2 t aap” 
see (Hestenes, 1956). With B = I one obtains an un- 
preconditioned CG method. In this case the norm in (3.6) has 


been chosen as follows: 

| |x| \? = aly (3.8) 
Now we will explain why (3.7) can be regarded as an 
acceleration of the BIM (1.2), under the assumption that B is 
SPD, so that one may write B = EE’, By substitution of 
r =E}, p= Ep y = Ey one finds that (3.7) is CG applied to 


the preconditioned system Ay = b with b = Eb, Aa" ET AE. 
According to (3.2), (3.6) we have 


Bx" = $,,(B"RE) ETE? (3.9) 


with 
||¢,, (E"aE)E"z°] | < [lyp (E aE) E" | 1 V¥, € a (3,10) 


From (1.2) it follows that 
Tn. pt Tje TT, eee t (3.12 
Er =E Yp (ABE )E E ron” (1 - x) a (3.11) 
since E'(aee") e T = (g7aE)* , V k eq. (3.11) can be 
rewritten as š 
T (3.12) 
Br” = y (E AE)E r? 
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Comparison of (3.9) - (3.12) shows that (3.7) will converge 
at least as fast as (1.2). The computation of Br” in (3.7) is 
performed by means of one iteration with (1.2), since 

Seay alee (3.13) 


The rate of convergence of CG has been studied in a number 
of publications. From (3.6) one may deduce, as in (Daniel, 
1967), that the required number of iterations n to reduce the 
residue by a factor e€ satisfies 


4s 


1 € 
n> z in 7 cond, (BA) (3.14) 


with cond, the condition number in the Euclidean norm. For 


a detailed study of the rate of convergence see (van der 
Sluis and van der Vorst, 1985). 


An effective and popular preconditioning is provided by 
incomplete Cholesky factorization, leading to the ICCG method, 
proposed by Meijerink and van der Vorst (1977). Its 
effectiveness is explained by taking a look at the spectrum 
of BA. Gustafsson (1978) proves that with a modified form of 
incomplete Cholesky factorization for the Poisson equation in 
two dimensions 


cond, (BA) = O(1/h) (3.15) 


so that because of (3.14) the computational cost of onb), 
with N the number of gridpoints. Compared with older methods, 
very impressive rates of convergence are obtained with ICCG 
for a number of difficult problems by Kershaw (1978). An 
implementation leading to greater efficiency is given by 
Eisenstat (1981). 


The restriction of CG to SPD problems is of course a 
severe one. Various CG type methods have been proposed for 
non-SPD systems. An example is the CGS (conjugate gradients 
squared) method, proposed by Sonneveld (1984). This method 
is defined as follows, in its preconditioned form: 
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fe) oO = 
=b-ay, g* =n =0, 
vest tgh, diauteiagh +n), 
3.16) 
+1 41 ( 
not? = u- a Bag”, yr sy +a (ut acy) 
+ 
Pra Polia (ue ne) i 
with 
Snir Pata PIBE TI a7 Bere ý 
T T 
=o z0 n 
p = BE p oTr BAg 


where z is a vector to be chosen. 

For a justification of this method see (Sonneveld, 1984; 
Sonneveld et al., 1985). It is related tọ the biconjugate 
gradient (bi-CG) method of Fletcher (1976). With bi-CG 


one has 


apis $ (BA) £° (3.17) 
With CGS one obtains at almost no extra cost 
x" = ¢2(payr® (3.18) 


The polynomials in (3.17) and (3.18) are the same. It follows 
that if bi-CG converges, then CGS converges faster (the S in 
"CGS" is inspired by the squaring of $,) . Furthermore, unlike 


T 
bi-CG, CGS does not use A . 


Bi-CG and CGS are but two examples of extensions of CG to 
non-SPD systems. We will not review other extensions that 
have been proposed, but restrict ourselves to mentioning the 
publications of Concus and Golub (1976), Vinsome (1976), 
Widlund (1978), Axelsson (1980), van der Vorst (1981) and ne 
and Jea (1980). A somewhat related class of methods si forme 
by the ‘chebychev methods: see (Manteuffel, 1977, 1978; 
Hageman and Young, 1981). These methods can be very efficient, 
but their effectiveness depends on the choice of certain 
parameters. 


non-SPD case the theory is far 


in the 
Unlike the SPD case, e kaa rrule T 


from complete. For CGS applied to a gener: 
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thumb is, that with ILU or IBLU preconditioning good 
convergence may be expected if A satisfies 


Pie Aer bs ak ie (3.19) 


Sonneveld (1984) and Sonneveld et al. (1985) estimate the 
cost of a CGS iteration to be 60 flops (ILU preconditioning) 
or 88 flops (IBLU preconditioning) per grid point. 


4, MULTIGRID ACCELERATION OF BASIC ITERATIVE METHODS 


The basic ideas of MG are quite general and have wide range 
of application, including other fields than partial 
differential equations. MG can be used not only to accelerate 
iterative methods, but also, for example, to formulate novel 
ways to solve nonlinear problems, including bifurcation 
problems and eigenvalue problems, or to devise algorithms 
that construct adaptive discretizations. The volume edited 
by Hackbusch and Trottenberg (1982) presents a useful survey 
of all aspects of MG. See also (Brandt, 1984). 


We restrict ourselves here to the one aspect of MG 
mentioned in the title of this paper. This makes it 
possible to simplify MG, and to distinguish situations where 
its effectiveness is guaranteed. The significance of MG 
as an acceleration technique springs from the fact, that in 
principle a computational complexity of O(N) can be achieved. 
This has been proved rigorously under quite general 
circumstances; see (Hackbusch, 1982) for a survey of MG 
convergence theory. Work in this area is still going on. 


If the BIM (1.2) converges, it usually has the property, 
exploited by MG, that the non-smooth part of error and 
xesidue is annihilated rapidly, whereas it takes many 
iterations (more and more as the mesh size decreases) to get 
rid of the smooth part. The property of smoothness will be 
defined precisely shortly. The fundamental MG idea is to 
approximate the smooth part of the error on coarser grids. 
In the MG context the BIM (1.2) is called a smoothing process. 


A two-grid method can be defined and explained as follows. 


Let G be a fine grid and G C G a coarse grid. Let the sets 


ot grid functions G > R and G >R be denoted by Y and Y, 
respectively. Let the coarse grid approximation of (1.1) be 
given by 
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Ay =b (4.1) 


Furthermore, let there be given a prolongation operator P 
and a restriction operator R: 


a E chit Baie Sea i (4.2) 


A two-grid method for the acceleration of the BIM (1.2) 
j 


can be formulated as follows. Let y` be the current iterand, 


4i 
and let y? be the result of applying a coarse grid correction 


to y: 
Hy j ath 
yi = y? + PAR - ay?) (4.3) 
where we assume for the time being that the coarse grid 
problem is solved exactly. For the residue r =b- ay? 
we find: 
tt 2 (r - apa tax? (4.4) 


We now make the following choice for A, called Galerkin 
approximation: 


A = RAP (4.5) 


A more obvious choice of A would be to discretise the 


differential equation on the coarse grid G , but 
(4.5) leads to a more elegant exposition of MG principles, 


and has other advantages as well, to be mentioned later. It 
now follows from (4.4) that 
x? € xKer(R) (4.6) 


as noted by Hemker (1982) and McCormick (1982), In other 


words, x"? L Ker (R), which justifies the appellation 
"Galerkin approximation" for (4.5). Following Hemker (1982) 
we relate the concept of smoothness to Ker(R): 


Definition 4.1 The set of R-smooth grid functions is 


Ker! (R) x 


vely, since in practiœ 


This definition makes sense intuiti 
es with positive weights 


R consists of taking weighted averag 
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over the fine grid, so that a grid function in Ker (R) has 
many sign changes, and hence may be called non-smooth. 


It remains to annihilate the non-smooth part of r, and 
this is done in the second part of two-grid iteration, called 
smoothing. This is done with the BIM (1.2): 


sah = yp? + B(b - ay!) (4.7) 


and we find: 


vt j+: 


= (I - AB)r (4.8) 


The projection operator on Ker(R) is I - R (RRT) R, and we 
conclude from (4.8) and (4.6) that 


jts 


Ott. (1 - AB) (I - Ree’) IR) r (4.9) 


This leads us to the following definition: 


Definition 4.2 The R-smoothing factor of the smoothing 
process (1.2) is 


Pp = || (x -aB (a - R (RR) R] 

The concepts of smoothness and smoothing factor can also 
be related to the range of P, or be introduced by means of 
Fourier analysis as proposed by Brandt (1977). For 
definitions, see (Sonneveld et al., 1985). Fourier analysis 
leads to less precise exposition, but to smoothing factors 
that are easy to compute. See (Brandt, 1982, and Kettler, 
1982) for more details. 


If we postulate that 


||z - apa tR|| <1 (4.10) 


(which holds approximately if P and R are accurate enough) 
then we may conclude from (4.4) and (4.9) that 


j+ j ; 
MA s dla -aa - RM err) Ay | [||] | (4.2) 


Without coarse grid correction, the residue reduction factor 
would be ||z - aB||. Equation (4.11) shows how MG accelerates 
the convergence Of the BIM (1.2), if the BIM reduced non- 
smooth residue components efficiently. 


MULTIGRID AND CONJUGATE GRADIENT ACCELERATION 359 


Choosing A according to (4.5) has the advantage that the 
user needs to specify the matrix and the right hand side on 
the finest grid only. Hence a multigrid code can be 
programmed such that it is perceived by the user just like 
any other subroutine for solving linear algebraic systems. 
Furthermore, (4.5) provides automatically a sound 
"homogenization" in cases where the coefficients or the right 
hand side vary rapidly. With difference approximation the 
implementation of the boundary conditions on the coarse grids 
is quite often not trivial. On the other hand, depending 
on the problem, computing A with (4.5) may be more expensive 
than constructing a finite difference approximation. Also 
A has to be stored (perhaps to be overwritten at a later 
stage by its incomplete factorization). With adaptive 
discretisation (4.5) is not feasible, since one does not 
know a priori what the finest grid will be. Results of 
numerical experiments comparing coarse grid Galerkin and 
finite difference approximation are given by Wesseling 
(1982A) . 


For second order equations prolongation may be defined 
by linear interpolation. If the "full multigrid" schedule 
is used, second order interpolation is also needed at 
certain stages in the schedule: see (Brandt, 1977). For 


restriction one may safely take R = Pp, For discussions of 
various other possibilities, and comparative experiments, 
see for example (Brandt, 1977, 1982; Stuben and Trottenberg, 
1982; Wesseling, 1982A). However, when the coefficients in 
the given differential equation are strongly discontinuous 
one should use matrix-dependent prolongation instead of 
linear interpolation, in order to take into account the fact, 
that the solution is locally linear only in a piece-wise 


T 
fashion. One may still use R= P . See (Alcouffe et al., 


198l; Kettler, 1982). 


The two grid method described above becomes a 
multigrid method if exact solution of the coarse grid is 
replaced by approximate solution, using MG with coarser 
grids. In the MGD-family of MG cođes we do this with one 
two-grid iteration employing an additional coarser grid with 
doubled mesh-size, and so on recursively, until the coarsest 
grid (usually a 3 x 3 grid) is reached, where one iteration 
is performed according to (1.2). The resulting MG method 
is said to be of sawtooth type, because its schedule is 
represented in a natural way by the schematic of Fig. 4.1, 
which is a sawtooth curve. 
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fine 


Fig. 4.1. Sawtooth multigrid schedule. 
A dot represents a smoothing step. 


Various more general MG schedules have been described, 
see for example (Brandt, 1977, 1982), (Stuben and 
Trottenberg, 1982). Some comparative experiments are 
described by Wesseling (1982A). The sawtooth schedule is 
the simplest possible MG schedule. One may wonder whether 
such a simple schedule can handle a sufficiently large 
variety of cases. Experience indicates that the answer is 
affirmative, see for example the experiments carried out 
by Wesseling and Sonneveld (1980), Kettler (1982), 

Wesseling (1982A,B), Hemker et al. (1983), McCarthy (1983), 
Sonneveld et al. (1985), and the application to transonic 
potential flow by Nowak and Wesseling (1984). We think that 
with an effective smoothing process and accurate coarse grid 
approximation, a simple MG schedule suffices for linear 
problems. 


The MGD-family of MG codes consists at the moment of the 
subroutines MGD1 and MGD5. These are meant for solving 
finite difference discretisations of a general second order 
partial differential operator (possibly not self-adjoint, 
with a mixed derivative allowed) on a two dimensional 
rectangular uniform grid. The user provides only the matrix 
and the right hand side on the finest grid. Because the 
user is not allowed to influence these subroutines by 
setting parameters etc, they are called autonomous. 

MGD1 and MGD5 work like a "black box". MGDl uses ILU 
smoothing, MGD5 employs IBLU. As a result, MGD5 is more 
robust. These programs are portable. In the near future, 
MGD1 will be available in the NAG library. Versions MGD1V 
and MGD5V have been designed for auto-vectorization on CRAY 
and CYBER-205, without sacrificing much on scalar machines. 
More details, and CPU-time measurements on CYBER-170, 
CYBER-205 and CRAY-1, and some design considerations related 
to these MG subroutines may be found in (Wesseling, 1982B, 
Hemker et al., 1983, 1984, 1985, Sonneveld et al., 1985). 
The MGD software may be obtained by sending a magnetic tape 
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to the second author of the present paper. 


MG software (the MGOO package) is also made available 
by the group at GMD, Bonn, see (Foerster and Witsch, 1982; 
Stiiben et al., 1984). MGOO is for self-adjoint second 
order elliptic partial differential equations on a 
two dimensional rectangular domain. Unlike the MGD codes, 
the user is allowed some freedom in choosing the MG algorithm. 
There is a choice in MG schedules, smoothing processes and 
restriction operators. For the Poisson equation an 
especially fast version is available. A mixed derivative 
is not allowed. 


Dendy (1982) describes a "black box multigrid" method 
that can handle self-adjoint equations with strongly 
discontinuous coefficients; the software mentioned before 
can handle this only when the discontinuities occur on grid 
lines that belong to at least a few coarse grids. The user 
has a choice of multigrid schedule and smoothing processes. 


It is to be expected that results of comparative 
numerical experiments with the software mentioned above will 
become available in the near future. 


5. DISCUSSION AND FINAL REMARKS 


It will be clear from the foregoing that CG has a 
sound theoretical basis only in the SPD case. For MG 
the O(N) complexity is theoretically well-founded only if 
one regards the coefficients in the differential equation 
as fixed. However, in practice one often has a need for 
numerical methods the properties of which remain invariant 
as a coefficient tends to a limit. . Examples are the 
convection-diffusion equation at high péclet number: 


= = el 
“tai GE £ (5.1) 


with 1/e the péclet number, or the Navier-Stokes equations 
at high Reynolds number, or the anisotropic diffusion 


equation: 


- E$ ay - % 22 =f (5.2) 


) also models the effect of having a 


Equation (5.2 
ry high aspect ratio. 


computational grid with cells of ve: 
Testing in practice seems to be the only way to determine 
how a method works in cases like this. In fact, also in 

= 0(1) practical tests are needed to 


easier cases where € = 
find out what the efficiency of a method is; there may be 
between two methods that 


large differences in computer time 
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both have complexity O(N). 


Therefore there is a need for test problems of 
sufficient generality such that one may have a reasonable 
degree of confidence in those methods that have done well 
on these test problems. One important test equation is 
(5.1), with u in "all" possible directions, for example 


o ° 
u, = cosa, u, = sina, and testing with a = 0O(15 ) 345°. 


1 2 
Having too few a values in one's test set can be quite 
misleading; see the results in (Hemker et al., 1983; 
Sonneveld et al., 1985). Another important test equation is 
(5.2), with the axes rotated over an angle a, resulting in 


Phi wines Fares NP 
RUSE fa BM bgaa Eeler aeh ir lesti gain E 
(5.3) 


with c = cosa, s = sing, This introduces the additional 
complication of a mixed derivative. A mixed derivative 
occurs in practice when a non-orthogonal coordinate mapping 
is used. 


The fact that the coefficients in (5.1), (5.3) are 
constant rather than variable does not make these problems 
easier, but more difficult. It may easily happen that the 
BIM that is accelerated with CG or MG is a bad preconditioner 
or smoother for certain combinations of € and a. This has 
more serious consequences when the unfavourable ¢,a values 
Occur throughout the domain than when these values occur 
only locally. 


Suitable test problems with strongly discontinuous 
coefficients are the problems of Stone and Kershaw: see 
(Kettler, 1982) for a description and further references. 


Whether a particular MG method will work for a particular 
Problem may be predicted by Fourier smoothing analysis. 
See Kettler (1982) for an extensive catalogue of Fourier 
smoothing analysis results, 


Sonneveld et al. (1985) report on numerical 
experiments, solving (5,1) and (5.3) with six methods: 
(1): MGD1, (2): MGD5, (3) , (4): MGDl with horizontal or 
alternating zebra line Gauss-Seidel smoothing , respectively, 
instead of ILU, (5): CGS with ILU preconditioning, (6): CGS 
with IBLU preconditioning. A zebra rather than a successive 
ordering was used because of considerations regarding 
Possible use of vector computers. Equation (5.1) is 
discretised with upwind differences so that property (3.19) 
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holds. It is found that (2) and (6) work efficiently in all 
cases. The performance of the other methods deteriorates in 
some cases. As far as MG is concerned, the observed behaviour 
corresponds with smoothing analysis results, in so far as 
these are available. There are special cases in which 
methods (2) or (6) are surpassed in efficiency by one or 

more of the other methods, as is to be expected. For 
example, if £ is large in eq. (5.2) a very effective and 
simple preconditioner/smoother is horizontal line Gauss-Seidel 
(zebra or successive). If one wants to handle as large a 
class of problems as possible with a single autonomous 

(black box) code without user-provided adaptions, IBLU 

should be used for smoothing or preconditioning. 


The experiments just mentioned were performed on a 
65 x 65 grid. Computing times with CG or MG were roughly 
the same. We have found that as the grid becomes larger, 
CG starts to lag behind, in accordance with theoretical 
computational complexity results. But it should be 
remembered that CG is easier to program than MC, 
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INTRODUCTION 


We are interested in transonic flow simulation around 
realistic aircraft geometries governed by the Euler equation. 
One of the most important problems is the grid generation 
around those complex geometries. In the particular case of 
Euler equations, due to its hyperbolic behaviour, it appears 
that the generation of adapted grids is a stiff problem. Our 
global strategy is to handle Finite Element-type triangular 
unstructured grids. 


In this paper, we introduce for Euler solvers new 
improvements related to grid optimization. Indeed, in order to 
identify accurately the Characteristics of the flow, the grid 
should satisfy the following important conditions: 


(i) to take into account the geometry, body-fitted grids 


seem mandatory. 
(ii) to be fine enough in the region where the equations to 


be solved need accuracy. 


To reach objectives (i), (ii), we propose the 


procedure: 
- Step 1: An initial coarse grid Be is generated, fulfilling 


following 


only condition (i). 
- Step 2: An adaptive procedure is applied, using intermediate 


simulations, in order to derive from the initial grid Oo a new 


grid Ok which will satisfy conditions (i) and (ii). 


ibe such an adaptive 


The goal of this paper is to descr. 
-D context, keeping in 


procedure applied to Euler flow in a 2 
sight the future extension to 3-D. 
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Two families of adaptive procedures can be recognized: 


In the moving node procedure, the nodes are moved while 
their number remains constant. 


In the enrichment procedure, the location of the nodes is 
fixed but their number is increased, 


For recent Euler computations with the first method we refer 
to Nakahashi and Deiwert (1985), Dwyer (1983), Lohner (1985). 


We are interested here only in the second method. For 
solving Euler equations, this method has been recently 
illustrated by locally refined Finite volume techniques 
combined or not with multigrids, see Usab and Murman (1983) and 
Berger and Jameson (1984) for the 2-D and also Baker, Jameson, 
Vermeland (1985) in a 3-D context. However Finite Element 
Methods have also been presented by Lohner (1985) and Palmerio 
(1984). 


In this paper, the adaptive procedure has been applied to 
two different schemes; 


(i) A Galerkin F.E.M. approximation using an artificial 
viscosity, 


(2) A new second-order accurate upwind scheme with no 
viscosity parameter adjustment. 


With scheme (1), we study a realistic industrial problem to 
illustrate the ability of the method for a better capture of 
the singularities (shocks, angular stagnation point,...). 


The problem of spurious entropy generation at stagnation 
points is considered with Scheme (2). 


1. ADAPTIVE PROCEDURE 
1.1 Triangulation and degrees of freedom 
We consider a 2-D F.E.M. type triangulation of the domain 


of computation N. This means that a polygonal region a, is 


constructed, which is a "good" approximation of 2. Then a 
triangulation oF of a, is constructed, satisfying the standard 


conditions (see Zienkiewicz, 1971). 
For a given vertex i the neighbour vertices are those which 


belong to a triangle having 2 as a vertex, Let us denote by 
w(i) the number Of neighbours of a vertex i; a triangulation 
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can be unstructured in the following sense: N(i) can be 
different from N(j) for two different vertices i and j (see 


Fig. 1 Example of different number of neighbours of a vertex 
(N(iq) = 5, Nli) = 8) 


As far as mesh refinement is concerned, this means that we 
shall be able to increase locally the number of elements 
without changing the data structure describing the mesh. From 
this point of view, the unstructured grids provide more 
flexibility than F.D.M./F.V.M. ones, for which a locally 
refined mesh is described in a different manner from the 
‘initial (I,J,K)-mesh (extra loop and matching interfaces must 
exist in the computer program). 


Representation of the unknowns 


A piecewise linear continuous approximation is chosen: the 
degrees of freedom are values of the functions at nodes. The 
nodes are located at vertices of the triangulation (conformal 
element). This choice has been introduced previously in 
several papers (Angrand and Dervieux, 1984, Billey, 1984). 
Many arguments favouring this choice have been presented in the 
above papers and also in Dervieux (1985). 


1.2 Enrichment algorithm 


1. Firstly a criterion is used, to decide which triangles 
require additional effort on accuracy. For these triangles 
new points are introduced, located at the middle of each side. 


2. The identification of the sides to be divided is then 
derived in a second step. 


3. In the third step, new triangles are constructed by 
considering in the old triangles the number of sides containing 


372 PALMERIO ET AL. 


new points (see Fig. 2); three situations can appear: 


- If there is only one point added, the former triangle is 
split into two new ones. 


- If two points must be added three sub-triangles are 
created. 


- With three new points (on each mid-side), then the 
triangle is divided into four sub-triangles. 


oe | 
eee <A 
ee <A 
A - & 


Fig. 2 Mesh enrichment procedure, .formation (or not) of new 
triangles. 


1.3 Criterion of enrichment 


We consider a physical criterion highly advocated in steady 
Euler calculations. In these calculations, the entropy S should 
be purely convected, i.e. satisfying 
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> oe 
u.VS=0 (1.1) 


ee 
where U is the velocity of the flow, except in the vicinity 
of the shocks which act as sources of entropy. 


This observation leads us to define a new criterion to 

determine the region to be refined, i.e. on each triangle where 
+> 
the scalar product U.VS is greater than a constant C times the 
>_> 

maximum of v.Ws (taken over all the triangles). 

As a consequence, the regions of refinement will locate the 
elements where spurious numerical entropy production occurs 
and also the physical entropy production on shocks. C is a 


parameter controlling the level of refinement that we wish to 
perform. 


2. APPLICATION TO EULER FLOWS 

We present some experiments of the above methods using a 
currently used Galerkin scheme. In a second section we present 
experiments with a new upwind scheme much more suitable for 
adaptive enrichment. 


2.1 A Richtmyer-Galerkin scheme 


The Euler equations are denoted by: 


w, + (FO), + (60), = 0 (2.1) 
pu pv 
2 + uv 
a Pl py ean =] PMS (2.2) 
puv pv +p 
(e+p)u (e+p)v 
p= ty - nie - Fete tv) payee eat 82" 3} 


where W(x,y) are the usual conserved quantities (p,pu,0v,e), 
and energy; u and v are the velocity 


i.e. density, moments, 
e the total energy and y the 


components, p the pressure, 
specific heat ratio. 
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An explicit Richtymer-Galerkin scheme is constructed; it 
consists of a predictor-corrector sequence (see Angrand and 
Dervieux, 1984, Angrand et al., 1983). We recall the two steps 


of this scheme in which Va is the standard Galerkin space of 


approximation: 


VAT { continuous, and linear on each triangle of ot 


1. Predictor step: for each Ta triangle of Our 


nde CAEN di 
Wr) area (T) ME du Ho lae [lewy + ewy] o 
Th ani (2. 4) 
2. Corrector step: We Vv, 1 VOEV, 
wet af w S 
I Ete to? dx dy = By i [rung + GW") 6.) ax dy 
'h h 


+B, Í l [rio + cmg] dax dy + | $¢[FW@) v, + cv] do 


h on, (2.5) 


+x | | £0") <7 wW V6 > ax ay 


with 


1 = 
7 1 B a te sa” Kue 


a, Bie and B3 correspond to an optimal choice of the length of 
the predictor time step. The notation v = aX) holds for the 
outward unit vector normal at dye The symbol £_ denotes a 


mass-lumping operator allowing the computation of pe without 
solving a system. The boundary conditions are included in the 
W integral. The last term of the right hand side is an 
artificial viscosity one. For further details we refer to 
Angrand and Dervieux (Ibid.). 
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2.1.1. A first simplified numerical experiment: Transonic 
flow in a channel with a bump. 


Our goal is to evaluate the behaviour of the scheme applied 
to meshes containing a stiff refinement. Thus we do not 
consider the adaptive procedure but a single refinement of the 
boundary triangles in the following problem, proposed in a GAMM 
Workshop (see Rizzi and Viviand, 1981): the geometry is a 
channel with a 4% thick circular bump; the Mach number at 


entrance is M, = -85. 


The comparison of the pressure outputs will be done with a 
simulation using a rather fine regular mesh of 1512 nodes (Fig. 
3). 
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WYNN ZS ZSZ 
WN ZN ZN ZS 
WYN ZN Z 


) 
; 
i) 
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[7 WV WYVINZS ZS 
CZ SZ SSO WMA ESS 


| 


i) 
| 


( 


I Se Se SOS 
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Fig. 3 Channel with a circular bump, fine mesh reference 
(1512 nodes). 


The initial mesh is very coarse, with only 161 nodes 
(Fig. 4). The refinement is performed twice on the first row 
of triangles lying on the underneath bumped wall. The use of 
this procedure provides a mesh of 411 nodes (Fig. 5). 


coarse mesh (161 nodes) 


Fig. 4 Channel with a circular bump, 
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—— | — 


Fig. 5 Channel with a circular bump, refined mesh (411 nodes) 


We consider now the pressure distribution on the bumped wall 
corresponding to the different grids: on Fig. 6, a sharp shock 
is obtained with the fine mesh by a costly computation, while 
in Fig. 7 the shock is spread also only over two strips but 
seems not sharp enough because of the coarseness of the mesh; 
with the locally refined mesh (Fig. 8), we obtain a sharper 
shock still spread over 2-3 strips of elements. The comparison 
on Fig. 9 with the fine mesh calculation proves that the shock 
location is correct. 


K, 
1.0) 


Fig. 6 Channel with a circular bump, pressure coefficient 
distribution with the fine mesh of 1512 nodes. 
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1.0 


0.0 


Fig. 7 Channel with a circular bump, pressure coefficient 
distribution with the coarse mesh of 161 nodes. 
-K 
1.0) 


0.0 


Fig. 8 Channel with a circular bump, pressure coefficient 
distribution with the refined mesh of 411 nodes. 


Fig. 9 Channel with a circular bump, comparison of the pressure 
coefficient distribution obtained with the fine mesh 


and the refined mesh. 
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Finally the distributions of the entropy deviation are also 
compared on Fig. 10, 11, 12; we get a better representation of 
the entropy level, although it is slightly oscillatory. We can 
derive from this numerical experiment the following conclusions. 


Fig. 10 Channel with a circular bump, entropy deviation 
distribution, with the fine mesh of 1512 nodes. 


0.01 


Fig. 11 Channel with a circular bump, entropy deviation 
distribution, with the coarse mesh of 161 nodes. 


= 


0.01 


Fig. 12 Channel with a circular bump, entropy deviation 
distribution, with the refined mesh of 411 nodes. 
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l. The Finite Element Method behaves satisfactorily when we 
perform a rather stiff refinement. 


2. In particular, the artificial viscosity model has also a 
good behaviour as far as shock profiles are concerned . 
Entropy distributions are also much improved but a sensitive 
tuning of the viscosity terms seems necessary to obtain 
oscillatory-free results. 


2.1.2 A second numerical experiment of industrial interest: 
a supersonic flow around and inside an inlet. 


In the following, we consider the complete adaptive 
algorithm. The problem to be solved is a 2-D flow around and 
inside an inlet. The half-geometry is considered, which 
reduces the domain to a channel ended by the engine (internal 
flow) and connected to the external flow. The Mach number at 
infinity is M, = 2. and the Mach number at the engine (outflow 


boundary)is Mo = 2l 
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Fig. 14 Aircraft inlet, partial view of the final mesh (1925 


nodes) - 
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A double enrichment is performed via the entropy criterion. 
Starting from an initial regular mesh, the self adaptive 
refinement acts in the vicinity of the oblique shock and also 
of the sharp edge to provide local enrichment. Comparison of 
the kK contours show a large improvement of the accuracy; the 


oblique shock is sharply captured, with only 3 strips of 
elements transition (Fig. 15, 16). 


Fig. 15 Aircraft inlet, Ke contours, coarse mesh (1145 nodes) 


Fig. 16 Aircraft inlet, Ko contours, refined mesh (1925 nodes) 
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2.2 The upwind scheme 


The combination of the above Galerkin Euler solver with 
grid refinement gives rather good results. But with this kind 
of centred scheme with added artificial viscosity, non- 
oscillatory entropy distributions are hardly obtained without 
adding extra viscosity. This problem is still more difficult 
when the mesh size varies rather rapidly. 


Therefore we focus now on an upwind scheme where no 
parameter adjustment is needed. 


This scheme, advocated previously by Van Leer (1983), has 
been derived for unstructured triangular meshes by Fezoui 
(1985). 


We start from a Galerkin representation, using the above 


space V, ; this enables us to construct approximate derivatives 
in Yi as follows: 
ow 
Í ae dx dy 
Vi vertex, WU) = 2PC) (2.6) 
ž Í dx dy 
suppli) 


where supp(i) is the support of the basis function related to 
vertex i. The resulting approximate gradient is used in the 
two phases of the time-stepping: 


l. Prediction phase 


a At RAR A ; =, : 
ew Slr orm, + 6 wD WD (2-7) 


2. Correction phase 


miha a E ff (re vv + Gv.) 20 (2.8) 
i i 4 x y 
area(i) ai 


The second phase is a Finite Volume formulation with respect 
to cells i constructed around vertices i with the medians of 
neighbouring triangles; for the numerical integration A 
(indicated by the superscript *) on the cell boundary di, we 
use integration values a located on the mid-segments [iJ] 


with vertex i and neighbouring vertices j as extremities: 
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x a Wl) oem 
Weg =W, + siJa 


i i W (i) 
3 y 


then a plus-minus flux splitting is applied to Wij and W, we 


Ja 
refer to Fezoui (1985) for details. 


Our purpose is to show that smooth (oscillation-free) 
entropy distributions can be obtained using this new scheme 
even in presence of strong local refinements: as a test case 
we consider a transonic flow around a NACAOO12 airfoil with a 
Mach number at infinity equal to M, = .85, without angle of 


attack. We start with an initial mesh containing 600 nodes 
shown on Fig. 17. The first value of the grid refinement 
parameter C is 3% and we obtain an enriched mesh with 1049 
nodes presented on Fig. 18. Then, a second refinement is 
performed with the parameter C taken equal to 7%, but this time, 
the grid refinement has been performed only in the front part 
to avoid the addition of too many points near the shock. By 
that procedure, a final mesh with 1615 nodes is obtained 

(Fig. 19). 
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Fig. 17 NACA 0012 airfoil, blow-up of the coarse mesh (600 
nodes). 
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Fig. 19 NACA 0012 airfoi 
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l, blow-up of the final mesh (1615 nodes) 
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We can make the following comments: on one hand the 
entropy distributions presented on Fig. 20, 21, 22 keep a 
rather smooth aspect at the leading edge, on the other hand, 
numerical entropy reduction is achieved in a monotone manner, 
with a ratio of about 3 for each refinement. 


The final result gives an entropy level of the order of 107 
almost everywhere before the shock. 


It is interesting to compare on Fig. 2l, 22 the entropy 
distribution obtained with only one enrichment (1049 nodes) 
with the distribution obtained with a full uniform splitting 
of the initial mesh which leads to a regular O-mesh with 2280 
nodes; the good agreement before the shock is clear, 
demonstrating the efficiency of the local refinement approach. 


Clearly the final resulting mesh is not the best available 
NACAOO12 mesh; our purpose is to promote local self-adaptive 
refinement as a flexible and efficient method to generate 
physically adapted triangulations from a priori arbitrary 
initial meshes. 


Fig. 20 NACAOQO12 test, entropy plot with the coarse mesh of 
600 nodes. 


Zz 


0.01 


o 


Fig. 21. NACAOO12 test, entropy plot with the medium mesh of 
1049 nodes, 


3 
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“M 


0.01 igor 


Fig. 22 NACAOO12 test, entropy plot with the final mesh of 
1615 nodes. 


zZ 


pele 


0.01 


Fig. 23 NACAOO12 test, entropy plot with the uniformly 
refined mesh of 2280 nodes 


3. CONCLUSION 


In this paper, we have described a self-adaptive mesh 
enrichment algorithm based on: 


- Finite Element type unstructured meshes 


- a physical criterion well adapted to a sensitive accuracy 
problem in Euler aumerical simulations. 


The corresponding enrichment procedure has been combined 
with two representative Euler solvers, to illustrate the 
accurate and efficient possibilities of these schemes when 
performed on severely unstructured meshes: 

An oblique shock has been captured accurately with the 
Galerkin F.E.M. solver. 


We have however pointed out the better behaviour of the 
upwind scheme to obtain smooth entropy distribution; this is 
likely to be due to the fact that the numerical viscosity of 
the upwind scheme is self adapted. 
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The resulting algorithm enables us to generate in a fast 
aerodynamic manner well adapted meshes depending not only on 
the geometry but also on the inputs (M,, angle of attack, 


etc...) in order to solve a new class of problems; it seems 
to have a quite good potential for further 3-D applications of 
industrial interest. 
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AN ADAPTIVE ORTHOGONAL GRID METHOD FOR 2D OR 
AXISYMMETRIC COMPRESSIBLE FLOW PROBLEMS 


Luca Zannetti 
(Dipartimento di Ingegneria Aeronautica e Spaziale, 
Politecnico di Torino, Italy) 


l. INTRODUCTION 


A finite difference time-dependent method has been proposed 
recently to solve inverse problems in ducts for compressible, 
inviscid, rotational flow fields (see Zannetti (1984)). 


The method is extended in the present paper to deal with 
more general problems, including external flow direct problems. 
The method belongs to the category of adaptive grid techniques. 
The grid is body fitted and orthogonal and its metric factors 
depend on the flow properties. Once the steady solution is 
reached, the grid is formed by a set of streamlines and lines 
orthogonal to the streamlines. 


The numerical computation is based on the lambda- 


formulation (see Moretti (1979), Zannetti et al. (1981), 
Moretti and Zannetti (1984)). The simple form of the boundary 
id allow the 


conditions and the orthogonality of the gr: 
qualities of the lambda-formulation to be fully exploited, 


particularly as regards consistency of the computation at the 
boundaries and accuracy in transonic flow fields. 


2, MATHEMATICAL MODEL 


£ the envelopes of the paths of 


Denote by s the arc length o: 
Š fluid during a transient and by 


the particles of a compressible 
n the arc length of the lines orthogonal to the envelopes. 


Define a pair of curvilinear coordinates along the envelopes 
and their orthogonal trajectories ¢ and y , related to s andn 
by 


= (2.1) 
ds = hj d¢ , dn hody 


where hy and hy are metric factors. 
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Taking t and x as independent variables, the equations of 
motion are 
Be “y 
E a = «S.) + 6a ( +q) =0 
at «Se F hy (ay 3 p hy hy 


2 a Y a A 
aw, +a E +g - F8y = 9 (2.2) 


where a is the speed of sound, q the modulus of the flow 
velocity, v is the flow angle, S is the entropy, 
t yeu ne} 
ô = 7 and K 6 Fe 
and all flow properties are normalised to suitable reference 
values. 


Since we are not interested in a physical description of an 
actual transient, but are looking for the steady solution of 


(2.2), we assume hy and hy to be defined as the scale factors 


relative to the steady state. By so doing, we alter the 
physical meaning of equations (2.2) only so far as the time 
derivatives are different from zero, and the method we use 
reduces to a "pseudo" time-dependent method. When the flow is 
steady the particle paths coincide with the streamlines: thus 
it is natural to select the variable y as playing the role of 
stream function. Therefore the metric factor hy is given by 


pate 
hy = 5g AWD (2.3) 


where A(y,t) is the derivative of an arbitrary stretching 
function. Similarly if the flow is irrotational, is is natural 
to select the variable $ as playing the role of velocity 
potential: therefore it is 


HE 
h sg B (ot) (2.4) 
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If the flow is rotational, the variable ġ loses such a 
physical meaning and the metric factor hy has to depend on 


the vorticity. By definition, in this set of orthogonal 
curvilinear coordinates, the vorticity is given by 


Pens Ahir Hi (2.5) 
i) hih, 

By integrating (2.5), the metric factor h can be expressed as: 
is a hy 

exp e— |S avira, (2.6) 


q 
h, =B(¢$) Ye 


q 
where the integral has to be computed along y = constant lines 
and the vorticity w is evaluated by means of the relationship 
TS, -E 
ee ee yo, (2.7) 
hy q 


where H is the total enthalpy. Equation (7) is again valid 
only at the steady state, in agreement with the general idea 
of disregarding the physical meaning of the transient and 
focusing interest on the steady solution. 


We have already decided to sacrifice the physical 
description of the transient in favour of getting the steady 
solution on a grid that (i) is orthogonal, (ii) does not have 
to be defined a priori, (iii) adapts itself to the flow. In 
the same spirit we can simplify equations (2.2). In fact, when 
the flow is steady, the entropy is piecewise constant along the 
streamlines, holding a value that has to be prescribed at the 
inflow boundary and jumping across shock waves if shock waves 
occur. Assuming that we are able to evaluate the entropy jumps 
across shocks, as explained in (Zannetti, 1984) the last of (2.2) 
can be neglected, the derivatives S, can be eliminated as well, 


and the derivatives S, can be regarded as known terms. With 


such assumptions the equations actually used for the computation 


are q, N 
2 E$: ede 
al + a, + da(— +a) = 0 
t h è h hy 
q a 
PE, Dp EAEL Yee 2.8 
+q + =0 (2.8) 
qe h h 6 
v a 
A ESE SA 5 
EA E EEE SEE E 
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3. NUMERICAL TECHNIQUE 


The same hyperbolic nature is shared by (2.8) and the 
original Euler equations. The same boundary conditions apply 
to both sets, as well. The numerical scheme we use is the 
lambda-scheme, as formulated in (Moretti and Zannetti, 1984). 


Following this paper we rewrite (2.8) as follows: 
6 
a. = 7f tf, + f, + £,) 


1 
a= 3 (fy > £) (3.1) 


where 


(3.2) 


and 


a (3.3) 


mh 
i 
I 

~ 


mh 
uw 
I 
> 
Ht 
Q 
< 
[i 


ADAPTIVE GRIDS FOR COMPRESSIBLE FLOW 393 


The form (3.1) of the equations of motion is the result 
of three linear combinations of four compatibility equations on 
four characteristic surfaces. Each compatibility equation 
contains derivatives taken along two lines on a characteristic 
surface, viz, the bicharacteristic, along which signals 
propagate, and another line. The linear combination above 
eliminates all the terms not related to bicharacteristics. 
Therefore the Ay express the components of the velocities of 


propagation along the four selected bicharacteristics and the 
fj terms express the convection of signals along the same 


bicharacteristics. The ġ derivatives and the Ņ derivatives are 
approximated by "upwind" differences, according to the direction 
of propagation of signals, providing consistency with the domain 
of dependence of each grid point. 


4. THE FORMULATION OF THE PROBLEM 


In Figs. la and lb the channel and the computational grid 
are sketched in both the ¢, p plane and the physical plane. 
The line AC is the inflow boundary, the line BD is the outflow 
boundary, and the lines AB and CD are the walls of the channel. 
At the inflow boundary all the flow properties are prescribed 
if the flow is supersonic, while the flow angle v , the total 
temperature T and the total pressure Pa are prescribed if the 


flow is subsonic. At the outflow boundary no boundary 
conditions are required if the flow is supersonic, whereas the 
static pressure is given if the flow is subsonic. 


Fig. lb. 


Fig. la. 
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When solving inverse problems in channels we look for the 
shape that the walls have to assume to satisfy the pressure 
distribution prescribed as boundary condition along them. The 
boundary conditions along the boundaries can be prescribed as 
function of the curvilinear coordinates ¢ and y: as a 
consequence the functions A(y,t) and B($,t) of (2.3) and (2.6) 
can be assumed as constant, A(w,t) = B(ġ,t) = 1. The length 
and width of the channel are unknown a priori and are part of 
the solution, as is normal for an inverse problem. Once the 
steady solution is obtained, the shape of the walls, as well 
as the shape of any ¢ = constant or wy = constant grid line, 
can be computed by numerical integration. The » = constant 
grid lines are computed through 

4 T 
Z=2, + $ Bap 


MA h äy (4.1) 
in 
and the y = constant lines through 
> P 
iv. 
= 2 
z Zin +,f e hj do (4.2) 
in 


where z = (x+iy) is the complex coordinate on the physical 
plane, 


When solving direct problems the geometry is known while the 

flow properties are unknown. It is convenient to prescribe 

the boundary conditions as function of the arc length s along 
the walls and the arc length n along the permeable boundaries, 
It follows that the values of the functions A(,t) and B($,t) 
can no longer be assumed a priori if we require that the 
computational points at the boundaries hold a fixed location 
on the physical plane during the transient. To this end we 


have to enforce the condition that the metric factors hy and hy 


be constant in time along the boundaries. To fulfil this 
requirement the functions A(y,t) and B(¢,t) must be computed 
and adapted to the flow properties along the boundaries during 
the transient, in order to assure the consistency of the metric 
factors with the geometry of the boundaries. Unfortunately, as 
is evident, this goal can be achieved at only one of the two 
opposite boundaries in an internal flow. 


The method described here is particularly suited to solve 
inverse problems, or to solve problems which are direct only 
for one solid wall, as in the case of mixed-direct-inverse 
problems where, for instance, the shape of one wall of a channel 
is prescribed while the other wall has to be designed in order 
to satisfy a prescribed pressure distribution along it, or for 
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solving full direct problems with only one solid wall, as in 
external flows or in internal flows with a symmetry line. 


5. TREATMENT OF BOUNDARIES 


Since the grid is orthogonal, we can use a procedure at 
the boundaries which is both simple and consistent with the 
hyperbolic nature of the governing equations. The boundaries 
are computed following the ideas expressed in (Zannetti, 1981). 


A certain number of the f j terms cannot be correctly 


approximated by finite difference at the boundaries since they 
express signals propagating towards the interior of the flow 
field. Such terms are considered as unknown in (3.1) and have 
to be evaluated using the boundary conditions. 


At the inlet boundary, if the flow is subsonic, the terms 


fy and £3 must be assumed as unknowns that have to be 


determined satisfying the boundary conditions. Here the flow 
angle v and the total temperature are prescribed as a function 
of y and are constant in time. By considering their time 
derivatives, the following equations have to be satisfied: 


fa, + aq, =0 (5.1) 


Ve =o. 
The two unknown terms can be determined by substituting into 
(3.1), that is: 
af) + alfi + £4 + fs) 
f, = ———— 


2 at 
{ (5.2) 
- Lig? tees 
3 2 
ilar 


e bound: ints on the walls are treated in a sim: 
AN when S inverse problems, the static pressure 
distribution is prescribed along the walls as independent of 
time. The entropy being assumed constant in time along the 
walls, the speed of sound is constant in time, that is 


a ONE 
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At the lower boundary, for instance the unknown term is f.: 


by enforcing the boundary condition on the governing equations, 
the unknown term is computed using 


fs Sista + fz E £,) . 


When solving direct problems, the flow angle v is given 
along the solid walls. Therefore, at the lower wall the 
unknown term fs is computed using 


E T A y 
The outflow boundary is computed in the same way. If the 
flow is subsonic the static pressure is prescribed as the 
boundary condition. The unknown term £ is then given by 


(Ce A 


6. EXTENSION TO AXISYMMETRIC SWIRLING FLOW 


The present computational technique can be extended directly 
to axisymmetric swirling flows. Keeping the same notation 
used so far, let us denote by v the tangential component of 
the flow velocity in an axisymmetric motion. The governing 
equations, with the simplifications introduced in (2.8) and 
(3.1) can be written; 


Nee) g sinv 
a, = 7(f, + f, + £, + £5) - ôa y 
l v? 
a= z2 = f) + Pan sinv 
1 WA 
VA ii a Ba 
t a23 + £, -£,) + Ta cosv 
= ae 
Ve fs = sinv 
with 
fs = -A3 vý 
The metric factors hy and h, are expressed by (2.7) and (2.4) 


respectively, as in the two-dimensional flow. In (2.7) w 
now means the tangential component of the vorticity. For its 
computation (4.1) has to be replaced by 
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2 TS -E + wy, 
fe ey a aA y 
yq h, q 


The computation can be carried on in the same way as in 
the two-dimensional case, except that during the integration 
in time the physical radial coordinate y has to be updated by 
taking the imaginary part of (4.2) or (5.1) at each 
computational point. 


7. NUMERICAL EXAMPLES 


Four numerical examples of the proposed method are 
described here. The first two refer to full inverse 
computations, the third to a mixed direct-inverse computation, 
the fourth to a full direct computation of a duct with a 


symmetry line. 


The Ringleb flow has been selected as the test case. Two 
streamlines of the Ringleb flow, in the transonic region, 
have been chosen as the walls of a channel whose shape has to 
be determined by the present method. The original 
irrotational Ringleb flow has been modified to test the method 
when dealing with rotational flow fields. To this end a non- 
uniform total temperature is imposed on the flow field, while 
the total temperature is kept uniform. The resulting flow 
field is still analytically determined, the shape of the 
stream lines, the pressure and Mach number fields are un- 
changed, remaining the same as in the original irrotational 
flow field, while flow velocity and temperature are different. 


The channel bounded by the streamlines > = .85 and y = 1.1 
of the Ringleb flow has been selected. It has been made 
rotational by imposing a total temperature distribution with 
Tó =1 at p = .85 and T5 =1.2at y= l.l, The theoretical 


pressure acting along the walls has been taken as the design 
input. Figs. 2 and 3 show the theoretical channel, with the 
iso-velocity and iso-Mach patterns, respectively. 


Fig. 2. 
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Fig. 


Figs. 4 and 5 show the computed shape, grid, iso-velocity and 


iso-Mach patterns. 


Fig. 4. 


Fig. 5. 
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The second example refers to the design of a two- 
dimensional transonic channel. The pressure is prescribed 
along the walls to make the flow subsonic at the inlet and 
at the exit, with a supersonic transition in the middle. The 
computed channel is shown in Fig. 6, where the grid and the 
iso-Mach lines are drawn. 


Lhe TTo 
ZAM 
LAAN 


Fig. 6. 


Since the pressure is prescribed as a smooth function of > , 
the flow has to be shockless at the walls, but in general we 
cannot expect the whole flow field to be shockless. In fact, 
as is evident from the iso-Mach lines of Fig. 6, an imbedded 
shock, due to the coalescence of compression waves, appears in 
the core where the flow goes from supersonic to subsonic. 
Fig. 7. shows the total pressure field. It undergoes very 
little variation over the flow field, except on the region 
downstream of the shock. The minimum value of total pressure 
is reached at the centreline, where the shock is stronger. 
The total pressure increases from the centreline out along 
the shock and practically reaches the upstream unperturbed 
value where the shock vanishes. 


Fig. 7. 
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The third example is a two-dimensional mixed direct- 
inverse problem in a channel. The problem is direct for the 
lower boundary, where the shape of the wall is known and the 
pressure on it is unknown, and it is inverse for the upper 
wall whose shape is to be determined in order to satisfy a 
constant pressure distribution. As stated above when 
discussing the formulation of the problem, the locations of 
the nodes on the lower wall are kept fixed on the physical 
plane by taking the metric factor hy constant in time along 


the lower boundary during the computation. To this end, the 
function B(p,t) is computed at the lower boundary, at each 
computational step, by 


* 
B(o,t) = ha 


* 
where hy is the value assumed by the metric factor at the 
starting condition, at the time t=O. The inlet boundary has 
been treated in a similar way to assure that its length 
remains constant in time. 


The computed channel is shown in Fig. 8 together with 
the grid and the iso-Mach lines. The shape of the lower wall, 
where the direct problem is solved, has been chosen to enhance 
the capability of the method to deal with awkward geometries. 


Fig. 8. 


The last example refers to a full direct problem with a 
symmetry line. The computation was carried on as in the 
previous example except for the upper boundary which is now 
a symmetry line, treated as a solid Straight wall. The nodes 
have a fixed location on the physical plane at the lower 
boundary during the computation, as explained above, while 
they move along the upper boundary in so far as the flow is not 
steady and the flow properties change in time, although the 
boundary conditions at this boundary do not change in time, 
being the slope Of the wall constant. The result is shown in 
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Fig. 9. It shows a two-dimensional transonic nozzle with the 
grid and the iso-Mach lines. 


AA 
445) 


Fig. 9. 
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ARTIFICIAL VISCOSITY Q ERRORS FOR STRONG SHOCKS 
AND MORE ACCURATE SHOCK-FOLLOWING METHODS 


W.F. Noh 
(Lawrence Livermore National Laboratory, Livermore) 


1. INTRODUCTION 


The artificial viscosity (Q) method of von Neumann-Richtmyer 
(1950) has been (and is) a tremendously useful numerical 
technique for following shocks wherever and whenever they 
appear in the flow. As we shall see, it must be used with 
some caution, as serious Q-induced errors can occur in some 
strong shock calculations. 


We investigate three types of Q errors: 


1. Excess Q heating, of which there are two types: 
(i) excess Wall Heating on shock formation and 


(ii) Shockless Q Heating; 
2. Q errors when shocks are propagated over a non-uniform 


mesh; and 
3. Q errors in propagating shocks in spherical geometry. 
We use as a basis of comparison, the Lagrangian formulation 
` eR ar) 2 
in (Colella and Woodward, 1982) with Q = Cope (a) , and 


refer to this as our standard calculation. In Section 2, the 
Langrangian differential equations with Q [in plane (ô =1,), 
cylindrical, (6 = 2), and spherical (6 = 3) geometry] are given, 


given 


272 
and we include an artificial heat flux term H = hope Jule used 


in Noh's (Q&H) shock-following method (see Noh, 1983). For our 
three Q's are defined: Qe Q0), and (e (where Q; 


is our standard Q, above, and Q, v) is the original definition 


given in (Colella and Woodward, 1982) (see also Richtmyer and 
Morton, 1967, p. 319) and depends on the geometry 6 = 1,''2,°0r 


3). Also two H's, H, and Ha are defined. Here, L refers to 


comparisons, 
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the normal Lagrange usage, in which the length 2 is taken to be 
equal to the Lagrange mesh interval Ax in the difference 
formulations, and shocks are spread over a fixed number of mesh 
intervals (=3) regardless of their actual size. E refers to 
the Eulerian (or "fixed length") definitions in which 2 is a 
constant (=bx oy)! and shocks are spread over a fixed physical 


distance (=3Ax, 8) + Then, in the (Q&H) method, on and H, are 
used together, as are Qz and Hoe Standard (staggered mesh) 
differencing is used (see Noh, 1985), and the nominal benchmark 


2 
Lagrangian Q, = 2p/ (du). 


In Section 3, the Wall Heating Q error is investigated in 
test problem #1 (Figs. 8a,b). This is an infinite-strength, 
constant-velocity shock that is generated when a zero- 
temperature (y = 5/3) gas is brought to rest at a rigid wall 
(x = 0). The excess Wall Heating error occurs in the first 
few zones near the wall (Fig. 1). Now, in nature, heat 
conduction prevents excess Wall Heating from developing, and 
thus the (Q&H) method (which includes an artificial heat flux 
term, H) also prevents the excess Wall Heating error. This is 
shown in Fig. 2. 


The Shockless Q Heating error is investigated using a test 


problem called "Uniform Collapse" (see Noh, 1976). Here the 
fluid is shockless, although it is everywhere compressing, and, 
consequently, an energy error Ac is introduced for Qe Qgr and 


QW). In particular, the Qg) error is much larger 

(for ô = 3) than for o (i.e., it is shown that be, (v) = 87be,) 
and this fact strongly favours our use of Q; as the standard 
Lagrange formulation. 


In Section 4, the second type of Q error is investigated by 
introducing a non-uniform mesh (Ax = RAX, for constant R) into 


problem #l (see Fig. 3). With Q = 2p (4u) 2 as our standard Q, 


the Q errors for R = 1.05 and R = 1.25 (Figs. 4 and 5) are 
compared. The errors are seen to approach 100% for R = 1.25, 
and thus can be a serious concern. A calculation using Qn (the 


fixed-length Q), taken together with A, (in the (Q, & Ho) 


method), eliminates poth type #1 and #2 errors (Fig. 6). 
Unfortunately, using 2. Spreads shocks over a fixed physical 
distance of = JAX ax’ Which is unacceptable in those regions 


where a smaller mesh interval occurs. A theoretical 
explanation for the non-uniform mesh error (see also Noh, 1985) 
Shows that letting 2 = Ax in the difference formulation implies 
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& = &(x) in the differential equation formulation of Que and it 


is this use of £ that generates this non-uniform mesh error. 
The (Q, 8H, ) method permits the use of a considerably smaller 


Q, coefficient, Cor and thus produces sharper shocks. This, 


in turn, produces smaller Q errors, as seen in Fig. 7. The 
(Q&H) method thus offers an acceptable procedure if the unequal 
zoning is not too severe. 


In Section 5, the spherical geometry Q errors ar 
A e in 
using Noh's spherical-shock test problem (see Noh, Sopy Weer when 


Fig. 8d). This type #3 error is considerably more complicated 
than the previous Q errors, in that it depends both on the Q 
formulation [e.g., Q, vs Q, (v)] and whether or not Q is 


treated as a scalar viscosity as in (Von Neuman and Richtmyer, 
1950) or as a tensor viscosity, as in (Shultz, 1964). The 
errors are seen to be truly enormous (Fig. 9), where the error 
for the standard Q; formulation is nearly 600% (near the 


origin) and is nearly 1000% for the original Q,,(v) definition 


of Q given in (Von Neumann and Richtmyer, 1950). In Fig. lo, 
we see how serious this error can be as a function of mesh size 
and just how slow the solution converges to the exact value, 


eal = 64, Indeed, even for K = 800, there is still a 
considerable error near the origin. The explanation is given 
in Fig. 11, where it is shown that the error results from the 
finite shock thickness, and thus the inability of the Q method 
to select the correct preshock density. That is, the shock 


spreading picks p < EEEN = 16. We conclude that sharper 


shocks give smaller Q errors and indeed this is shown in 

Fig. 12, when the (non-Q) PPM method of Colella and Woodward 
(1982) produces very sharp shocks and has minimal error. This 
error in PPM is further reduced by using an adaptive mesh 
technique to capture the shocks, and the result using 400 
zones (with an adaptive mesh) is equivalent to a normal 1200- 
zone (essentially converged) PPM problem. 


In Section 6, Schultz's tensor Q formulation (T) of the 
hydrodynamic equations is given, along with his tensor Q (5) 
definition (see Schultz, 1964). Calculations using this tensor 
formulation (Fig. 13) are a significant improvement over the 
standard scalar (S) solutions (e.g-, Fig. 9). However, there 
is only a slight improvement using Schultz's Q, (8) over the 
standard Q, Consequently, we conclude that the tensor 


equations (T) are more important than which Q, formulation is 
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used. Again, as sharper shocks reduce the type =3 error, 
nearly exact results are obtained using a very small 


2 
cå = %)in the (Q H,) (T) method, where Q, = (¥)p (Au) 
and H, = 10p|Au|Ae (Fig. 14). In Fig. 15, the various Qs, 


(Q&H), and the (non-Q) PPM method are compared. The best 
results are obtained from the tensor formulation using the 


(Q, aH) (T) method and PPM with an adaptive mesh. 


2. LAGRANGIAN FLUID EQUATIONS WITH ARTIFICIAL VISCOSITY (Q) 
AND HEAT FLUX (H) 


von Neumann and Richtymer (1950) introduced their artificial 
viscosity Q as a scalar quantity, and we take their formulation 
of the Lagrangian fluid equations as our standard. Also, the 
new (Q&H) shock-following method of Noh (1983) (which uses an 
artificial heat flux H in addition to the artificial viscosity 
Q to follow shocks) is included in the formulation. 


2.1 Differential Equations 


The independent Lagrange variables are r and t, where r is 
the initial position of the Eulerian (physical) coordinate 
(i.e., R(r,O) = r), and u, fp, €, and P are the velocity, 
density, internal energy, and pressure. The differential 
equations for plane (ô = 1), cylindrical (ô = 2), and spherical 


(6 = 3) geometries are then (letting dm = Sper tar = pose) : 


6-1 
w= -ôR (P+O) n momentum ' 
Ke =u 
6 
v =l/p = (R li mass ’ (2.1) 
Er 6-1, 
Ep = (PH) vy, + 6(R B) a energy , 
P = P(p,e) equation of state . 


2.2 Definitions of Q and H 


In Q and H, we include linear terms (see Noh, 1976, 1983). 
These are used in some of the Q error comparisons to produce 
smoother shock profiles, but otherwise don't affect the Q 
errors, Also, the Qs and Hs are set to zero if the indicated 
tests fail to hold. 
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Standard Lagrange Q: 
bhi uz <0 


2A leony 2 2 
Q, (CoC) = Coet (u) -C PC tu, 
(2.2) 


Standard Lagrange H: H. (h? h) = cae 
of OOF phob = hpt lu le, + hyec.te,, 
(2.3) 
Eulerian fixed-length Q: Q (c2,c )= cpt ( yc PC e 
fu, < Oe ESO! 1 OMAR als. 8 oR 
(2.4) 
A 2 re LP) 
Eulerian fixed-len H: nry 
gth Hp (borb) = hope lu,le,+ hy PCER 
(2.5) 
Original Lagrange formulation: Q, (v) = E A 
(2.6) 


t£ pn < O (see Von Neumann and Richtmyer, 1950), 


where Cor Cyr hy and hy are constants, % is a constant with the 


dimensions of length, and Cs is the local sound speed. The 
usage is (Q, 8H, ) and (Q,,8H,,) + Equation (2.6) is the original 


Q formulation (see also Richtmyer and Morton (1967), p. 319) in 
terms of v = (1/p), and we note it is the only Q here to depend 
on the geometry (ô). In the Lagrange formulation (L), the 
standard use is to take % = Ar in the difference equations. 
This spreads shocks over a fixed number (#3) of mesh intervals 
(regardless of their size), while in the Eulerian formulation 
(E), & is constant (Ax nax)” and shocks are spread over a fixed 


physical length (=32). Standard "staggered mesh" difference 
equations are used (see Noh (1985) for details), and the 2 
nominal Q benchmark difference formulation is Q; = 2p (du) ^. 


3. EXCESS Q HEATING 


There are two excess Q heating errors: (1) excess wall (or 
piston) heating due to Q, which occurs on shock formation (e.g-, 
at a rigid wall where a gas is brought to rest, and a shock is 
propagated away, or for the sudden startup of a piston); and 


(2) Q heating for shockless compressions (e.g-, when u <0, 


but no shock is present). 
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3.1 The Wall Heating Error Test Problem 

Test problem #1 is that of a constant-state, constant- 
velocity shock of infinite strength (i.e., the pre-shock 
pressure P = pe = 0). The shock is generated in a perfect 


(y = 5/3) gas by bringing the cold (9 = 0) gas to rest at a 
rigid wall at x = O. This is just the familiar constant- 
velocity (piston) shock, but in a frame of reference where the 
piston (here a rigid wall) is at rest (Figs. 8a, b). In Fig. l, 


+ 
p is plotted for our "standard calculation" using Q = 2p e 


The shaded area (the Wall Heating error) occurs typically in 
the first three zones next to the wall (or piston). That this 
error is unavoidable for any shock smearing method, is argued 
in (Noh, 1985). 
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Fig. 1 Qi = 2p(Au)”. Shaded area is "Wall Heating Error", 


Typically it is over three mesh intervals. 


In real fluids, heat conduction is present, and Wall Heating 
does not occur, and this is the basis of Noh's (Q&H) method, 
Eqs. (2.1) and (2.3). In Fig. 2, Q, (0-67, 0.2) is used, plus 


the heat flux term H, (073/4) = 3/4pC_Ac. As expected, the Wall 
Heating error is zero. We also note that the (Q, &H_) solution 
j 3 L L 

is considerably smoother, and this permits the use of much 


smaller Q constants Co and cy (with generally smaller Q errors). 
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Fig. 2 The (Q&H) method, where Q'S H = $ R + 
1 3 3 
5 pc lâu] and H, (O) = TESES The heat flux 
term, H, ređuces the Wall Heating error to zero. 


3.2 Shockless Q Heating Errors 


ssion wave exists (i.e., 


This is the situation where a compre: 
lution is shockless. 


u,„<0) , and thus Q + O; yet the exact so 
we consider the useful "Uniform Collapse 
Problem" (see Noh (1976), p. 60), in which a flow is everywhere 
undergoing a compression, but no shock develops. We consider 

a unit "sphere" (O< r <í 1), (for ô = 1, 2, or 3), and to 
simplify the analysis of the energy errors due to Q, we assume 
pressure to be a function of density only, P = P(p)- 


For this analysis, 


° o 

The initial values are ulr,0) = -r, p =P 1 & = €, and 

p° = P(p°) , with boundary conditions u(O,t) = O and u(l,t) = -l. 
The exact solution is that the fluid simply coasts with its 
initial velocity w = -r) until all points uniformly collapse 
onto the origin R = O, at time t = 1. It is easy to verify 
that the exact solution is given by 

6 o 
ulz,t) E ROE and vs (1/p) = (1-t)" (1/p ). (3.1) 


Thus, p = p(t), P = p(p) = P(t), and € = e(t) (since 
6-1 
). 


o 
Ep7 eig 6/o P(t) (l-t) 
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It follows that Q = Q(t), and thus the solution of Eq. (3.1) 
continues to hold except for the energy equation. If we let 
the error in energy be Ae = fle, + Pv) dt = - Tov at, then, 


from Eq. (3.1) and t = (l-t), 


2 
=- are 3.2 
de, SQV, at CACOS] 2n (T) + ( ) 
T 2_-2 
n = 3.3 
de, SQ- Vat 78 (Coe) Ti s and ( ) 
Ae, (v) = -JQ (v)v dt = -6° (c 2) 7an(t). (3.4) 
L L E fo} 
We note that 
2 
Ae, (v) =6 de, and de, < Aen: (3.5) 


Now, as T > O, the error Ac + œ, and this Shockless Q Heating 
error can indeed be serious. From Eq. (3.5) and (ô = 3), we 
see that the error Ae, (v) is nearly an order of magnitude 


greater than the error ae, It is due to arguments similar to 
these that Noh in 1956 suggested that the oF of Eq. (2.2) be 


taken as the standard Q formulation for all geometries 6 = 1, 2 
or 3. We make the suggestion again (and form more 
reinforcement, see Figs. 9 and 10), since Q (v) still seems to 


be in common use. 


Fig. 3 The unequal-zoned, infinite-shock test problem. Initial 
and boundary conditions are the same as test problem #1 
(see Fig. 8b). Here, the mesh interval decreases for 


the first half of the mesh (R < 1), then increases for 
the second half (R > 1). 
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4. Q ERRORS FOR A NON-UNIFORM MESH 


The second type of Q error occurs when shocks are 
propagated over a mesh with unequal intervals. In problem #L, 


let 


bx = RAK (4.1) 


where R is a constant: 1 $ R $ 2. We investigate the cases 
R = 1.05 and R = 1.25. In order to show the errors for both 
decreasing (R < 1) and increasing (R > 1) mesh intervals, we 
let the mesh decrease for the first half of the problem, then 
increase for the second half (Fig. 4). 


Type #1 Error 


2 6 10 14 18 
x 


R=1,05 
Fig. 4 Standard Q, (240) = 2p (Au)? and R=1.25. The type #2 
error (Ap =p - 4) is positive for the first half, 
where the mesh interval decreases (R < 1), and Ap > O 
for the second half (R> 1). The type #1 Wall Heating 
error is present in the first few zones next to the 
rigid wall on the left. 


In Fig. 5, R = 1.05, and the density is plotted for our 


standard Q, = 2p Uui: The total error is shaded, and again we 


see the familiar Wall Heating error in the first several zones. 
E = př- 4, is > O for the first 

The new error, Ap, Fiera Pexact Pr f : 

half (decreasing mesh), and Ae, < o for the second half. This 


new (type #2) non-uniform mesh error grows with R and becomes 
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very serious (=100%) for R = 1.25. This is unfortunate, since 
it is not uncommon to use R = 2 in practice, and thus R = 1.25 
might well be considered a modest zoning change. 


7.6F RTA ae 
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AX’), = 86.7 
R= 1.25 


Fig. 5 Standard or (2,0) = 29 (Au), and R = 1.25. Here the 


error is very serious (=100%). 


The good news is shown in Fig. 6, where R = 1.05, and the 
Eulerian "fixed length" Qe is seen to eliminate the non- 
uniform mesh error. When Qr is used in conjunction with HE in 
the (Q, 8H) method, then both the type #1 (Wall Heating) error 


and the type #2 (non-uniform mesh) error are completely 
eliminated. The bad news is that very large Qe constants are 


2 2 
necessary [ where (Coe) = (C Atna? = 6 (for fy = 1) and 


Cx ax = 0.8 (for Co = 0.33)], and this is seen to spread the 
shock over a large number of smaller zones. In this regard, 


the use of Qg is not satisfactory. 
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Fig. 6 R = 1.05, and we compare the fixed-length Qg with the 
(Q,8H,) method. 


Au, 2 A P ee 
(1) Qp (60-8) = 60 (ay) + o-se li |; this eliminates 


the non-uniform mesh error. 

Ac ` SNF 
(2) Qp (670-8) & Hp (0,6) = 6pc x” which eliminates 
both the type #l Wall Heating error and the non-uniform 


mesh type #2 error. However, there is too much shock 
spreading over the finely zoned region using or for 


this to be a practical way to minimize these errors. 


(3) Exact solution. 


A more practical solution is to use the (Q, 8H, ) method. 


This eliminates the Wall Heating error, and (as mentioned 
earlier), the use of the heat flux H permits the use of a much 
smaller O constant Co (and c) , which, in turn, reduces the 
type #2 (non-uniform) mesh error. in Fig. 7;.R/= 1.05 and 


Q, = ACRE (i.e., ey = 1) is compared with the (Q, 8H, ) 


method, with a corresponding reduction in the non- 
zoning error to around 3%. 


uniform 
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R= 1.05 


Fig. 7 R = 1.05, and we compare Q; (with reduced coefficient 


Co = 1) with the (Q, 8H.) method. 
(1) Q, (1,0) = p (4u) 2. The solution is noisy, but 


type #l and #2 errors are reduced. 


2 
(2) Q, (1,0) = p(4w" & 8 (0, $) =F pc Ae. using 
both Q and H (with small Q coefficients) eliminates 


the Wall Heating type #1 error altogether and reduces 
the non-uniform mesh type #2 errror to ~ 3%. The 
(Q, aH) method is much smoother than Q; alone and may 


be a practical compromise for mesh-interval changes 
that are not too large. 


4.l Theoretical Discussion 


In the standard Lagrange difference formulation of Que the 


length 2 in Eq. (2.2) is taken to be 2 = Ax, and thus Qu) > 


(Au). 


Now, when an unequal mesh interval is used, Ax is no 


longer a constant, and this implies that £ = 2(x) in the 
differential formulation of Qi in Eq. (2.2). In fact, where 


Ax, is given by Eq. (4.1), 2(x) is found to be 


&(x) = 2[(R-1)x + Ax] /(R+1) ` (4.2) 
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See Noh (1985) for details and further explanation of the 
non-uniform mesh error. 


One consequence of using Eq. (4.2) in Eq. (2.2) in the 
differential equations, Eq. (2.1), is that steady travelling 
shocks are no longer solutions. This is clear since the shock 
width (instead of being a constant) will now be proportional 
to (x). This would still be an acceptable numerical 
approximation for shocks if only the proper shock jump 
conditions held, but our numerical experiments show that this 
is unfortunately not the case. 


5. Q ERRORS IN SPHERICAL (ô = 3) GEOMETRY 


The third type of Q error is related to strong shock 
propagation in spherical (or cylindrical) geometries. This 
error is considerably more serious (up to 1000% error in 
(excess) shock heating near the origin), and is also more 
complicated than the previous Q errors. This third type of . 
Q error depends on the Q formulation lise., Q, of Eq. (2.2) 


vs. Q, (v) of Eq. (2.6)] and also seems to depend on whether 


Q is treated as a scalar or a tensor viscosity in the 
formulation of the hydrodynamic equations. In particular, a 
tensor formulation (Section 6) due to Schultx (1964) gives 
sharper shocks than our standard use of Q in Eq. (2.1), and 


this is instrumental in reducing this third type of Q error. 
Test problem #3 is just the spherical (ô = 3) generalization 
of test problem #1 where the post (infinite) shock solutions 


(a, oo ee and pt) are, again, constant step-value functions 
(Fig. 8d). 


le) 


S=1/⁄3 t= 0.6 
(b) 


ed ee) 


wel 


12 $=1/3 t= 06 


(G) 
og 
l peli +t/R) 


(d) 


a P= =0 
phi p=(1+t/R)? 


p= (+uR? 


o1 02 0.3 04 


Fig. 8 The exact solution at t = 0.6 to Noh's (1983) Generic 
Constant-Velocity Shock problems: (a) initial conditions; 
(b) plane geometry (ô = 1) with a shock generated at a 
rigid wall; (c) a shock reflection from the axis of 
symmetry for a cylinder (6 = 2); and (d) shock 
reflection from the centre of a sphere (6 = 3). All 
solutions have constant post-shock states, and all have 
the same constant shock speed, S = 4 (for initial 
conditions (a) and y = 5/3). The essential difference 
is the preshocked density pf = 1 for ô; p 4 for 


6 = 2; f =16 for 6 = 3; and p = p°t1+t/r) $7} in front of 
the shock. 
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Our standard test problem has l100msh intervals (Ar = 0.01), 
and the results are compared at time t = 0.6. Since the shock 
speed is S = 1/3, then 80% of the mesh (i.e., 80 mesh points) 
have been traversed by the shock, and one would expect 
accurate results. Unfortunately, this is not the case, as is 


seen in Fig. 9, where the standard Q (2) = 2p (Au)? is compared 


R 


with the original Q, (V) = 2(dr) Et Gy)? = 2pl Au + 2U4B) 2 


70 


60 Exact solution 


50 


20 


10 


K 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40 0.45 
R 


Fig. 9 (1) Standard Q = H (u) 


(2) Q, (v) = 2(Ar) ? oz)’ Oy? [the original von 


Neumann-Richtmyer Q Rl aor Eq. (2.6)]. Here, 
Q; is superior to Q (V), but pet Qs are in serious 


error. The correct solution is ae = 64. 
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ep 
Here, both are compared with the exact solution, p = 64. The 
numerical results are strikingly poor, and in fact, hardly 

bear any resemblance to the exact solution. The error for the 


standard Q; = 2p (Au)? is seen to be on the order of 600% near 


the origin and 20% behind the shock, while the error for the 
original von Neumann-Richtmyer Q [here g0 of Eq. (2.6)] is 


roughly 1000% in the central region and nearly 40% behind the 
shock. Clearly this third type of Q error depends on the Q 
formulation. The solution using Q, (V) is seen to be 


definitely inferior to the solution using Qg: There are 


several reasons for this. One is related to the Shockless Q 
Heating Error of Section 3. Here, the first zone of test 
problem #3 is just a special case of the Uniform Collapse test 
problem, and we found in Eq. (3.5) that the Q, energy errors 


went as Ae, < Ae, (Vv) = te,» Thus, for 6 = 3, the error 
using Q, (Vv) is 9 times as large as using Qe There is an even 
more disquieting error in using Q (Vv) , in that it preheats the 


gas ahead of the shock. This occurs because, in the preshocked 
region (Fig. 94), V = 1/p = (1 + 5, and thus V, < O, and 
g + O.: This preheating is, of course, not physical (and is 


another instance of a shockless Q Heating error - note that 
Qg, does vanish as it should), and this error combines with the 


the large Shockless Q Heating error near the origin to produce 
the poor results of curve (2) in Fig. 9. It's amazing that the 
shock solution is as good as it is. Just how slowly the Q; (v) 


solution converges is shown in the comparisons of Fig. 10, 
where the results are plotted for various mesh intervals: 
K = 50 (Ar = 0.02); K = 100 (Ar = 0.01), up to K = 800, where 
Ar = 0.00125. Even at K = 800, the numerical solution still 
has an unacceptable error, These results show that 9, (v) of 


Eq. (2.6) is a poor formulation and is essentially the reason 
that our definition of Q, given by Eq. (2.2) is taken to be 


the standard Q (for 6 = 1, 2, and 3). We stress this point 
since Q, v) still seems to be in common use. 
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Fig. 10 This example shows the truly large errors (in density) 
4 
using the original Q (Vv) = 2(Ar) 7 (r/R) (av/at) 2 for 


various mesh intervals (Ar). The comparisons are 
t = 0.6 and Ar = 0.01, 0.01, 0.005, and 0.00125, or 
K = 50, 100, 200, and 800. This shows that the 
convergence of the density to the correct value 


+ 
p = 64 is very slow indeed, and even for K = 800, the 


error is unacceptable. 


Now, of course, these are still serious errors in the use 
of the standard Q, = 2p (Au) 2. This difficulty is analyzed in 


Fig. ll. The problem is associated with the shock smearing due 
to Q. Because of the finite shock thickness, the calculation 
"senses" an incorrect (too small) jump-off value of the 


preshocked density (p). That is, the shock smearing selects a 


pes 's: BEREN = 16. This error is a maximum at early times and 


becomes less serious as time ađvances as the (similarity) 
solution spreads out the preshocked region over more and more 
mesh points. Thus, a given shock thickness produces less and 
less error as time increases. The key, then, to more accuracy 
is to sharpen shocks as much as possible. 
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(1) Solution independent of EOS | 
(1) ut =0, e* = 1/2, and in front of shock 


1 = plane 
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(3) Ẹ = 3 (shock front) 
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(11) Solution dependent on EOS P = (y - 1)pe = 2/3 pe for (y= 5/3) ~] 
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Pou. aih 
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64 sphere 5 = 3 


e- |- t=30 
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Fig. 11 The solution for Noh's spherical test problem (Fig. 84d) 
is given at two different times (t = 6 and t = 30) for 
the scale variable, — = t/R. As t increases, the 
preshock density profile is spread over a physically 
greater_and greater distance. Hence, the preshock 
value p = 16 should be progressively easier to 
resolve numerically as time advances. The wiggly line 
is the numerical solution using the standard 


2 
Q = 2p (Au)~. The numerical error is so large (20% 


á e < 600%) that it hardly resembles the exact solution, 
a 
p = 64. 
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refinement at the shock) 


PPM 
Shock-fitting 
(<Q = 0) method 
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“Fig. 12 The (non-Q) PPM method of Colella and Woodward (1982) 
has very narrow shocks (l or 2 mesh intervals), and, 
for the standard test problem, (K = 100) gives superior 
results. PPM using mesh refinement (i-e., a shock- 
capturing adaptive mesh with K = 400) is equivalent to 
the standard PPM using K = 1200. It is clear that 
using an adaptive mesh is a very important procedure 
for accurately tracking shocks. 


The non-Q PPM method of Colella and Woodward (1982) produces 
very sharp shocks (of the order of one-to-two mesh widths), and 
their results are considerably more accurate than the use of 
the standard Qs This is shown in Fig. 12. Also, the PPM 


results on the standard K = 100 problem are compared with their 
very accurate "adaptive mesh shock-following procedure" using 
K = 400. The K = 400 results are also shown to be nearly as 
accurate (converged) as the standard PPM with K = 1200. The 
effect of using an adaptive mesh is to minimize the actual 
(i.e., physical) shock thickness, and, as we have argued in 
Fig. 11, is all-important in determining the correct preshocked 
value p = 16. Clearly, using an adaptive mesh for resolving 
shocks is an important procedure, and such a method would be 
equally effective for any Q [or (Q&H)] shock-following 
procedure. 


6. SCALAR VERSUS TENSOR Q FORMULATIONS 


Schultz (1964) proposed that Q should be treated as a tensor 
viscosity and gave the following (T) formulation of the 
hydrodynamic equations (for ô = l, 2, and 3). We include the 
von Neumann-Richtmyer scalar (S) formulation, Eq. (2.1), again 
for comparison. Also, the use of the artificial heat flux (H) 
remains the same: 
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pu + PTR , 


iz A scalar (S) 
u a = 
ple, + PV) = -olup + (6-DEI - Say IR aly, 
R (6.1) 
pu, +P, = +o, + (6-1) 2] 5 
1 $-1 Tensor (T) 
Pee Fave! = “Qu gay FR BI, 
R (6.2) 


Schultz also defined a new Q, which we denote by 


Ory 
Q,,(S) = Cope |u. 


3/2) 11/2, (6.3) 
rr r 


iS Mai O, and O otherwise (and where % = Ar in the difference 


formulation as usual). 


Now, Schultz's Q, (S) doesn't produce a Shockless Q Heating 
error (since a = 0 for the Uniform Collapse Problem of Section 


3), and we thus might expect superior results for our spherical 
test problem #3. tJndeed, the results (Fig. 13) using Q, (S) (T) 


and Q, (T) [i.e., using Eg. (6.2)] are significantly better than 


using the scalar (S) equations, Eq. (6.1), but there is 
essentially no improvement using Q, (5) over Q We conclude, 


then, that the major improvement occurs because shocks are 
narrower (for any Q) in the tensor (T) formulation. The 
reasons why shocks are sharper is not entirely clear, but it 
follows, in part, from the formulation Eq. (6.2), where we note 
that there is less Qdv shock heating than for the scalar 
equations, Eq» (6.1). That is, Qdv > Qu, independently of 


geometry (6 = l, 2, or 3) in Eq. (6.2). The (Q, 8H, ) (T) method 
[i.e., Eq. (6+2)] with a small Q constant CÊ = 1/4 is 


particularly accurate. This is shown in Fig. 14, where we 
compare Q, (1/4) (T) (which indeed has a narrower shock, but is 


extremely noisy) with Lo, (1/4) sx, (10)] (T). These (Q, 8H, ) (T) 


results are reasonably smooth behind the shock and are 
essentially exact. Thus, we find the best all-round results 
for the 100-zone test problem are given by the (Q, 8H) (T) 


shock-following method using Schultz's tensor formulations, 
Eq. (6.2) or by using the (non-Q) PPM method with an adaptive 
shock~following mesh. 


50 


40 


Density 
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Schultz's tensor Q 
formulation (T) 
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The lower curve uses our standard Q, (2,0) = 20 (Au) 2, 


but is cast in Schultz's tensor formulation, Eq. (6.2). 
The upper curve also uses Schultz's tensor for 
simulation, Eq. (6.2), along with his Q (S) = 

2. (3/2), 1/2 f 
plau]? |u] of Eq. (6.3). We see that the results 
are essentially the same for either Q, formulation. 
We conclude, then, that it is the tensor use of Q that 


is important, rather than the particular choice of 
Qz" Consequently, we stay with the standard 


95 2p (4u)? usage. 
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Fig. 14 A comparison of (o (T) and the (Q, Hy) (T) method 


[i-e., Eq. (6.2)] using a very small Q, constant 
c? = 1/4. Here, the Q, (1/4) = 0.259(Au)? (T) 


formulation is very noisy, but produces a narrow 
shock. The sharp shock remains in the LQ (1/4 & 


H, (10)] (T) method [where H, (10) = lop|Au|Ae)], and 


we see that most of the post-shock noise is damped, 
and the density and energy errors are nearly zero. 
Thus, the (Q, aH.) (T) method is a preferred shock- 


following procedure. 


The results are summarized in Fig. 15, where we compare 
the various QSr (Q85) , and the PPM method. 
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Standard Q (S) 
Q= 2p (Au)? 


Density 


0 0.1 0.2 0.3 0.4 
Radius 

Fig. 15 We compare all of the methods for the standard K = 100 
test problem and for both the scalar (S) and tensor 
(T) formulations [Eqs. (6.1) and (6.2)]. The (non-Q) 
PPM Method lies between the scalar Q, (5) and (Q, 8H, ) 
(S) results, but is not as accurate as the Q; (T) and 
the (Q, 8H, ) (T) results. PPM with mesh refinement, 
curve (5), and the (Q, 8H.) (T) method, curve (6), give 


essentially the converged (exact) solution and are the 
preferred methods. 
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A SHOCK-FITTING SCHEME FOR THE EULER EQUATIONS 
USING DYNAMICALLY OVERLYING MESHES 


C.M. Albone 
(RAE, Farnborough) 


1. INTRODUCTION 


The problem of representing, accurately, embedded shock 
waves that occur in transonic flow with a subsonic free stream 
is considered. In most existing schemes (see Lytton, 1984, 
Jameson et al., 1981, and Hall, 1985) for solving the Euler 
equations, embedded shocks are captured as regions of high 
flow gradient spread over a few mesh intervals. Shock fitting, 
as in (Moretti, 1973), provides the mechanism by which shocks 
may be represented by surfaces of discontinuity; such a 
mechanism eliminates problems of shock resolution and removes 
the arbitrary degree of smoothing which is present in the 
currently popular non-upwind Euler algorithms such as those of 
Hall (1985) and Jameson et al. (1981). The problem is to 
devise a practical shock-fitting scheme, capable of dealing 
with flows in which little is known about the shock 
configuration. 

In the present method (described more fully in (Albone, 
1985)), shock fitting is approached using overlying meshes whose 
relative position-changes in the course of the calculation. 
For a single embedded shock wave, this involves a 
shock-orientated mesh that floats through the main mesh as the 
fitted shock finds its natural position within the field. 

This approach has the advantage of aligned shock fitting without 
placing extra constraints on the main mesh. On both meshes, 
an upwind, non-conservative, time-split algorithm is employed. 
Shocks are first captured on the main mesh, and then, after 
detection, are fitted as part of the calculation on the 
floating mesh. At each mesh point on the shock the 
Rankine-Hugoniot equations for a moving shock are applied 

and simultaneously the shock speed is evaluated. These 
speeds are then used to define a new position of the fitted 
shock, and the floating mesh moves with the shock to a 

new location within the main mesh. The use of an upwind 
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algorithm on the main mesh is not essential for the fitting 
scheme reported here, but it is preferred. Its use on the 
floating shock-orientated mesh considerably simplifies the 
logic of the fitting scheme. 


The method has been tested for the simple two-dimensional 
model problem of supercritical flow past a circular-arc 
aerofoil. The main mesh is rectangular and the surface 
boundary condition is formulated in the thin-aerofoil 
approximation. Results from capturing and fitting have been 
obtained over a range of supercritical Mach numbers. 


2. THE SHOCK-CAPTURING ALGORITHM 
The Euler equations are solved here in non-conservative 


form and are referred to rectangular Cartesian axes, (x, Zs. 
They are 


cles leg ðq 
Eye Blog aa son (2.1) 
where g = |A) ,B=juco Oy 7D=|woc o 
-c 
u u 
co yD Pae ETT 
w oou o c -a 
2" ¥ GHD) 
s ooo u ooo w 
2c 
and A E 


The velocity components in the x and z directions are u and 
w, respectively; c is the speed of sound, t is time and s is 


entropy E lIn (/%), where p is pressure and p is density. 


A first-order-accurate algorithm, for the solution of equation 
(2.1) is constructed from two one-dimensional upwind operators. 
It has been found convenient to use as dependent variables the 
interdependent set P, Q, E, F and S, where P =u +A, Q=u-A, 
E =w +A and F =w - A; their dependency is given by the 
relation P -Q =E - F, The set P, Q, E and F are often 
referred to as Riemann variables, since in one-dimensional 
homentropic flow each is a Riemann invariant. The resulting 
discretization scheme for equation (2.1) is 
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$ 2 

ô P = -(u+c)ô_ P - SRE wee c 

t (u+c) “i (w) Su 7 ô E + ) ar + TCT) 8S, 
(2.2) 

pa +C w-c aS 

6.9 = -(u- A pel 2) jg faa 

t2 (u c) 2 (w)ô u + í 7 ôE f > ô F + TC) x! 
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> o 2 

E=- 2 yd eae ES Pe 

t (w+c) ô E (u) S w [ 2 | OP + 7 ô 2 + -I 6s, 
(2.4) 

> H 2 

ô F = -(w- = eS pei Relat, ks 

t (w c) ô F (u) bw + 7 6 P ( 7 52 + AEN) ô 5r 
(2.5) 

ES 

6,8 = =- (u) 6s - (w) 65s , (2.6) 


a 
where 6 is a forward difference operator and 6 is a difference 


operator which may be forward or backward (8) . For all space 
differences in equations (2.2) to (2.6), except those for S in 
equations (2.2) to (2.5), 6 is interpreted as Se the bracketed 


coefficient (propagation speed) is positive and as 6 if the 
bracketed coefficient is negative. Where ô operates on S in 
equations (2.2) to (2.5), it has the same interpretation as 
for P in equation (2.2), Q in equation (2.3), E in equation 
(2.4) and F in equation (2.5). 


Since the scheme is explicit, the allowable time step is 
governed by the CFL condition. The propagation speeds, 
u u +c, u- Cc, W, w +c and w - c are evaluated as an 
upwind average. Given a uniform distribution of entropy far 
upstream, the solution obtained from equations (2.2) to (2.6) 
(in the absence of shock fitting) can result only in homentropic 
flow. 


Solid surface boundary conditions in the present formulation 
are linearised. Flow tangency for the circular-are aerofoil is 
therefore satisfied on the chord line z = o. Flow variables on 
a line of dummy points below z = o are set by symmetrical 
reflection of A, S and the tangential velocity component, and 
by antisymmetrical reflection of the normal velocity component. 
Far-field boundary conditions are those corresponding to a 
uniform stream. Where the boundary is homentropic, there is no 
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need to test for inflow or outflow or to test for subsonic or 
supersonic flow; the upwind algorithm will select precisely 
what it wants from the complete set. For non-homentropic 
boundaries (subsonic outflow in the presence of shock fitting), 
boundary conditions for Q and S, required by the difference 
scheme, are necessarily evolutionary. That for S is obtained 
by solving equation (2.6) on the downstream boundary. That for 
Q is obtained by solving equation (2.2) for P on the downstream 
boundary and by using the specified downstream pressure. 


3. SHOCK FITTING 


3.1 Detection and sharpening 


The shock-fitting procedure is intended to correct the 
errors of non-conservative capturing by representing the shock 
as a discontinuity across which the Rankine-Hugoniot equations 
are imposed. The first step is to detect any captured shock 
waves and determine their location and shape. For the present 
simple model problem, captured shocks are nearly normal to the 
x direction, and so detection can be based upon the appropriate 
sign change of u ~ c. 


At this stage, the distribution of pressure etc, close to 
the shock is that for a smeared, captured shock. It is 
desirable to sharpen this distribution before beginning the 
fitting process. In order to carry this out, dependent 
variables adjacent to the shock location are adjusted by using 
linear extrapolation from neighbouring points on the same side 
of the shock wave. This adjustment is never repeated once 
the fitting and moving schemes are operating. 


3.2 Interaction between fixed and floating meshes 


The two interacting meshes are shown in Fig. 1. Points on 
the fixed mesh are indicated by either squares or triangles, 
and those on the floating mesh by circles or diamonds. Squares 
denote mesh points at which the Euler algorithm can be applied 
without any risk of differencing across the shock wave. 
Triangles denote those points where there is a risk (not a 
certainty) of differencing across the shock. Boundary values 
at the diamond points, required for the calculation on the 
floating mesh, are obtained using bilinear interpolation 
from values on the fixed mesh. Plow variables are then updated 
at circle points on the floating mesh, and simultaneously shock 
speed is evaluated at each mesh point on the shock wave as 
described in section 3.3. Next, flow variables at the square 
points on the fixed mesh and at the fixed mesh boundaries are 
updated. Finally, interpolation is again used, but now to 
update flow variables at triangle points on the fixed mesh from 
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the updated circle point values on the floating mesh. It 
should be noted that interpolation involves mesh points all of 
which lie on only one side of the shock, and so interpolation 
across the shock cannot take place. Further, the floating 
mesh has been made large enough so that all interpolated 
values are obtained wholly from points that are updated using 
the Euler algorithm. The calculated shock speed is used to 
change the location and shape of the shock and so move and 
distort the floating mesh as described in section 3.4. 


Fig. 1. Shock-orientated mesh overlying the fixed mesh. 
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‘The floating mesh is defined with the coordinate direction 
normal to the shock as X and that along the shock as Z, the 
shock itself being X = O. Mesh intervals in the Z direction 
are formed by the intercepts of the shock with lines z = 
constant; those in the X direction are chosen just large enough 
for variables at diamond points in Fig. 1, when obtained 
by interpolation from points on the fixed mesh, to involve only 
those points marked as squares. On z = 0, the shock is normal 
to the body surface tangent. The mesh is extended in the -Z 
direction to form a set of dummy points, which comprise its 
lower boundary and in the +Z direction beyond the last point at 
which fitting takes place to form its upper boundary. On 
every line Z = constant there are six points numbered 1 to 6 
as shown in Pig, 1. Points 3 and 4 are coincident, but have 
distinct values of flow variables; point 3 lies on the upstream 
side of the shock and point 4 on the downstream side. Points 
1 and 6 constitute the upstream and downstream boundaries 
respectively of the mesh, 


3.3 Calculation on the floating mesh 


We use as dependent variables the set equivalent to those 
used on the fixed mesh, but with velocity components defined 
in the X and Z directions and denoted by U and W respectively. 
The set is Pe Qer Bor Fy and § defined by 


Pee Uta, Qe = U-A, E,=W+A and F.=W-A, 


f f 


where the suffix f denotes floating mesh, Initial values are 
set at all points on the floating mesh, immediately following 


— N iai Ana m by using bilinear interpolation 


Updating of flow variables takes place at all circle 
points using the Euler algorithm of section 2 recast in the 


curvilinear system (X, Z). The dif 
pete enc , 3 ference equations take 


r» 
ôP, =- ~ ww =» |e l [We 
Pe: THMUtC) Ske 40 R ea See +R es) SFe 


SHOCK FITTING WITH DYNAMICALLY OVERLYING MESHES 433 


+ 
= -(U- - i [We _ i [Wee 
6 2p = (0-0) 6,08 a R UR ( 2 } Se- R í 2 ) SFs 
i K 2 
a a a (3.3.2) 
‘4 l Ute U=: 
PEE S y: ài C 
$5. <> 5 (We) ô Eg (U) 5 W fe } oP. + 3) 8 Qe 
a KU 
+ RyG E (Wte) , (3.3.3) 
p l Utc U-c 
oe R M ea SPs =) 8s 
Pa KU 
——T - => (We 3.4 
+ eee (3.3.4) 
$ w 
ers esr 3.3.5 
6,8 (u) 5S pet ( ) 


where R = 1 + KX, and K is the shock curvature. The 
interpretation of the difference operator ô is the same as that 
for the fixed mesh algorithm described in section 2. Updating 
at points 2, 3 and 5 (Fig. 1) is performed using equations 
(3.3.1) to (3.3.5). At point 4, only equation (3.3.2) is 
solved; the remaining equations are replaced by the 
Rankine-Hugoniot equations for a moving shock, with upstream 
conditions taken as those at point 3. In this way due account 
is taken of the manner in which upstream propagation on 

the (U - c) characteristic affects point 4, and the number 

of equations enables us to evaluate the shock speed as well as 
the flow variables at point 4. ‘The shock speed, V, is given by 
the implicit equation 


l 
2 2 
2c, f1- 2c, 2(y-1) | 3-1] 
+ 2 1+ = Qla 


3° YDM YD (y#1)? M 


(3.3.6) 


434 ALBONE 


where Ql, is the value of Qe at point 4 obtained from equation 


(3.3.2), the suffix 3 denotes evaluation at mesh point 3 and 
M, = (U3 - V) lcz. The solution for V is obtained iteratively 


and is then used (together with flow conditions at point 3) to 
evaluate Per Egr Fe and S at point 4 from the remaining 


Rankine-Hugoniot equations. 


A study of the terms in equations (3.3.1) to (3.3.5) 
reveals the advantage of combining an upwind scheme with a 
shock-orientated mesh. No differencing can take place across 
a shock as long as U > c at point 3 and U < c at point 4. 

For instance, we see that at point 3, U, U + c and U - c are 
positive and so only backward differences are employed. 
Similarly at point 4 U - c is negative in equation (3.3.2) and 
so differences of Qe and S in that equation are forward. 


Boundary conditions for the floating mesh calculation are 
supplied at points 1 and 6 on each line Z = constant, and at 
all points on the top ard bottom boundaries. As with 
far-field boundary conditions on the fixed mesh, the algorithm 
is capable of sorting out what information it requires from 
the complete set available at the boundaries. 


3.4 Determination of a new shock shape, position and extent 


It appears at first sight that this should be 
straightforward. Each mesh point (2) on the shock should 


be moved a distance Atv(Z,) at each time step in a direction 


normal to the shock wave at that point (where At is the local 
time increment and v(Z,) is the shock speed) to define a new 


shock shape and position. A problem with this approach is 
that unsmooth shock shapes can be generated, and this may 

lead to numerical instability as shown by (Moretti, 1974) for 
bow shocks. Here, we have a further difficulty in that shocks 
terminate in the field with vanishingly small strength. The 
scheme adopted for an embedded shock with a subsonic free stream 
involves representing its shape by a constrained polynomial. 
This gives the shock an effective stiffness and so constrains 
the location of its weak free end. Although constraining the 
natural shape of a shock is in principle undesirable, its 
effect upon shock shape is very small for embedded shocks, 
which usually have low curvature. The unconstrained new shape 
is evaluated straightforwardly and then the constrained shape 
is obtained here by approximating the unconstrained shape by 
a single, weighted, least-squares-fit cubic which is normal to 
the aerofoil surface tangent at z = 0. The weighting is 
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applied so as to bias the best fit towards the stronger (and 
hence more entropy producing) end of the shock wave. 


Finally, we consider the question of how to allow the 
extent of penetration of fitted shock into the field to 
change.. It has only been found necessary to allow for shock 
shrinkage because captured shocks (and thus initially fitted 
shocks) are stronger than a Rankine-Hugoniot shock should be. 
As the solution converges the shock moves upstream, weakens 
and its field penetration is reduced. The need to reduce the 
extent of the shock is detected by evaluating the upstream 
normal Mach number at the shock point furthest into the field 
each time the variables on the floating mesh are updated. If 
this is less than 1.01, the shock is considered to have so 
little strength there that fitting is unnecessary at that 
point and so the field penetration is reduced by one mesh 
point and the floating mesh shrinks accordingly. 


4, RESULTS FOR A CIRCULAR-ARC AEROFOIL 


The choice of test problem was dictated by the need to 
provide a simple geometric environment in which to test the 
overlying mesh shock-fitting scheme. Calculations were 
performed on a 10% thick circular-arc aerofoil at zero lift. 
The rectangular computing region, -1.5 < x < 2.5 and 
O < Z $ 1.0 was covered by a uniform mesh, with the 
aerofoil chord line lying on z = O between x = O and x = 1.0. 
Results were obtained with a 128 x 32 mesh which gave a 
uniform 3.125% chord spacing everywhere. Calculations showed 
that the aerofoil has a critical Mach number of about 0.79. 
A series of calculations at Mach numbers (M,) from 0.80 to 
0.90 in steps of 0.02 was performed. The results showing 
captured and fitted shock waves at M, = 0.82 and 0.90 are 
contained in Figures 2 and 3. These were considered to be 
converged when the average normalised residual fell to 


a7 
5 x10. Using a three-level multigrid convergence 
acceleration scheme, this was achieved in 5 cpu seconds for 
Mo = 0.82 and in 8 cpu seconds for M» = 0,90 on a Cray ls. 
The two examples illustrate that the method works for weak 
and strong shocks, In the steady state, entropy jumps 
discontinuously from a value of zero ahead of the shock to 
its Rankine-Hugoniot value and thereafter it remains 
constant on streamlines, 
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5. CONCLUDING REMARKS 


A new shock-fitting scheme for the Euler equations is 
presented which involves overlying meshes, one of which is 
shock-orientated. The scheme is embedded in an explicit 
non-conservative upwind shock-capturing algorithm, which in 
turn has been linked with a multigrid acceleration technique, 
resulting in a rapidly converging method. The use of an 
overlying, shock-orientated mesh simplifies the logic of shock 
fitting and avoids overconstraining the main mesh at the cost 
of a small amount of interpolation. Further, shock-fitting 
appears not to conflict with multigrid. The scheme is robust, 
and there is no apparent reason why the analysis should not 
be extended to more general two-dimensional problems or to 
three dimensions. 
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LAGRANGE- GALERKIN MIXED FINITE ELEMENT APPROXIMATION 
OF THE NAVIER-STOKES EQUATIONS 


Endre E. Süli 
(Department of Mathematics, University of Belgrade, Yugoslavia)* 


l. INTRODUCTION 


Recently a new method has been proposed for the numerical 
treatment of convection-diffusion equations (Benque et al., 
1980; Russell, 1980). It is based on combining the method of 
characteristics with finite element or finite difference 
procedures. For convection dominated problems it has smaller 
time truncation errors than those of standard methods, as shown 
by Douglas and Russell (1982). 


Pironneau (1982) gave a detailed analysis of this method 
for the Navier-Stokes equations. He proved an error bound 


h+ At + BAE, which is suboptimal in space, but he allowed the 
kinematic viscosity to go down to zero. 


In this paper we present optimal error estimates for the 
method when the nonlinear convection term and the viscous term 
are roughly equivalent in importance. Using the Lagrangian 
representation of the flow, we discretize the total derivative 
of the velocity along the trajectories and we combine this 
special discretization with the mixed finite element method. 


2, FORMULATION OF THE PROBLEM 
2.1 The differential system 


We consider the nonstationary Navier-Stokes problem: 


u + (u.V)u - vAu + Vp = f 
V.u =0 in 2 x (0,7, (2.1.1) 
ulg =o ule = Yor 


*Now at the Oxford University Computing Laboratory. 
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where N is a bounded domain in r, T is a fixed positive 


number, u is the velocity field in r, p is the kinematic 
pressure, v the kinematic viscosity, f a density of body forces 
per unit mass and a, the initial velocity field. For 


simplicity, we assume homogeneous boundary data and the domain 
Q to be convex polygonal. 

For 1 < p <, let bP (9) denote the usual Lebesgue space 
with the norm ||.| | 1P. For m>0, w'P(9) will denote the 
Sobolev space of index m with the aie ||. {| wore (Adams, 1975) . 
For p=2 we write Bo (Q) for wu 209). a} (9) will denote the 


closure of the set of infinitely Al ffeventiable functions with 
compact support in N for the norm ||. | lat- Let X be any of the 


spaces just defined. Then x will denote the topological 
product X x X. 


We need the following special spaces, well suited to 
incompressible flow: 


H= {ue 1-(m?: Vows o, uno =o}, 
ve=f{ue am’: V.u = 0}, 


Leia) = { pEr : fe ax = or. 


For l <p<%, LP (O,T;x) will denote the set of strongly 
measurable functions u: (0,T) > X such that 


T 
Ilall P; =f Ilu] | P at )2/P <», L<p <>, 
L*(0,T;X) o x 
or 
Ilall A = ess sup ||u(t) || <o, p=, 
L (0,T;X) te (0,T) x 


Finally, C(0,T;X) will denote the set of continuous 
functions u: [0,T] + x normed by 


|lu] = mx |lu(t)|| . 
C(O,T;X) ~— te[O, 7] x 
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The mixed finite element approximation of problem (2.1.1) is 
based on the following saddle-point weak formulation. 


For f and uy given, 
ES L (0,7;17(2)7), u EV 
; 2 pee) 2 
find (u,p) such that (up) EL (0,T;H, (2) x L (9), 


ai (av) + (Cu. u,v) + v(Vu, Vv) = (Vevep) = (£,v) Vv æm’, 


(2.1.2) 
2 
(V.u,q) =O VqE L (%), 
u(O) = use 
2 2 
where the inner products are to be interpreted to be in L (2)". 


It is well known that problem (2.1.2) has a unique solution 
2 Pilea) 1 
(Temam, 1983). Moreover, (u,p) € L (0,T;H (2) x H (2)), 


u E C(0,T;V), uv E L (O,T;H). Hence the material derivative of 
u 
Diu =u’ + (u.V)u 


belongs to L7(0,T;12(9)2) and (2.1.2) may be rewritten as 


Wie) 
(Duv) + v(Vu, Vv) - (Vev;p) = (f,v) VvE H% 4 
Vu =0 Vaer, (2.1.3) 
u(0) = us 


The crucial aspect of the method is the approximation of the 
material derivative. 


2.2 Lagrangian representation of the flow 


The Lagrangian representation of the flow is based on the 
function 


Xt Gusit) € 2x (0,7)? > X(x,sit) € ®, 
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where X(x,s;t) represents the position at time t of the 
particle of fluid which is at point x at time t=s. This means 
that t 7X(x,s;t) is the parametric representation of the 
trajectory of this particle. If we are given the velocity 
field u, then the trajectories may be determined from the 
initial value problem: 


a 
s X(x,s;t) = u(X(x,s;t) ,t) 
a (2.2.1) 


X(x,s;s) = x. 
17?) 2 i 
If u belongs to L#(0,T;W (Q)"), then X(.,s;t) isa 
Lipschitz continuous bijection of Q onto Q and its Jacobian is 


equal to one almost everywhere on Q x (0,7). 


Let M denote a positive integer, At = T/M and tle m.At. For 
XE M seta tt Stas tatl’ O < m < M-l, (2.2.1) implies that 


is 
m+l 


Kim X(x t rt) = fp w(X(x,t Ft) pt) at 
m a22) 


= Ateu (X(t L meL) ty) = At.u(x,t 


mt! $ 


Let y = (1 + [ul2)2/2 and t = (1,u,,u,)/p. Then, 


du(x,t 


) 
m+l 9ot(x,t 


Dult a) = v(t.) a } 
m+l 


The backward difference approximation 


u(x, ta? y u(x tl) - U(X (xt rt) (Aa) 
FE mae 2,1/2 
at(x, ti) {|x X(x tli En l + eezal j 


combined with (2.2.2) gives: 


u(x,t 1) - u(Xx(x,t at), t) 
z m+l mtl m’ m 
D u(x, t a1? PA S a E 


At Fe 


Now we turn to the problem of spatial discretization of (2.1.3). 
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2.3 Formulation of the finite element procedure 


Let h denote a discretization parameter tending to zero hee 


let Xa and Ma be two finite element all such that Xa c ui (8) 


N 


and Ma CL (Q). We define W ES and Va = {vE Wis 
ov ap E mM} z 


no 


(Vev, d,) 


Assume that Wa (resp. Ma) is associated with a uniformly 
regular polygonalization Ch of 2 and piecewise polynomial 
functions of some fixed degree % (resp. &%-1l), & > 1. 


We relate the continuous and discrete spaces by the following 
hypotheses: 


Hl) There exists a positive constant C such that 


ing [lvl] 4 < eh” lvli 
iii! r+1 
vou H H 


vyen (ag? nut gi? Ver srs. 
H2) There exists a positive constant cy such that 


inf || a, Il 2 < ch Ila ll a 
D. 


WEM 
T 2 
vaen (a) NL% Vr: O<r<k. 


H3) (The BabuSka-Brezzi condition) There exists a positive 
constant C3 such that 

(Vev, 7q, ) 

hh 5 es 


inf sup a 
in © Ma TSM lvl a llall 2 


The aa a of the velocity and the pressure at ct will 


+: + 
be denoted by a and Ph Now, ae 5 Ph F (3 Wa x Ma is ae 


to be the solution of the discrete daphne: 


444 SÜLI 


1 
(aur, v) veva, ww.) - (ev eee y= (ey) YVE 


h h’ 
(2.3.1) 
Wut, gq) =o Wem, 
where 
m+1 m $ 
U at 


and etna) is the solution of the initial value problem: 


a m,m 
aS Xi (ert yy it) Fu tit)), t <t<tiy, 
(2.3.2) 
m 
XO tar Emen) =x. 
The initial approximate velocity u (3 Va is determined as 
the a - elliptic projection of us onto Vh 3 
o 
(Vu - ude Ww) =0 vv €E Vi ` 


The theory of mixed finite element methods shows that for 
u, u and p E smooth, the error uu, is optimal in 


space (order Ne in H 1) and time (first order). To be more 
specific, for a normed space X with the norm || | ly let 
Ploma = { v: {t,,...,t, x= [lvl] = 
cg WO S 2? (0,7;x) 
es J 
= (at Z |] vep] PP <}, 1< p< o, 
isl Sh ex 
% 
& (0,T;X) = { v: {tress rtu >X : MA = 
= mee [Ivey ile}, pas. 


l<ic<m 
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Assume that 


EE C(O; L (M), 
(2.3.3) 
stl 


2 
n 
uy EE (® W 


Hea? nv, 


where s > 1. Let us organize our regularity hypotheses on the 
corresponding solution (u,p) of (2.1.2) according to the 
results in which they are used. We assume 


2109)? av) 


u € L2(0,T; a (q)7 N w 
ue L (0,7;85 (0) N #) 
ute L (O,T;H) (2.3.4) 
p E L(0,7;85(9) N L) 


pE L (oTa) 


ele 1 


u E (onn (9)? 9 wht? (a)? 9 v) 


16g)? 1 wy 


we 12(0,7;H°* 

u"e L? (0,7;H) i (2:3:5) 
o s 2 

p E€ L” (O,T;H (2) ia) 15 (9) 


pE L (O, T:E (M). 


Theorem 1 (süli, 1985) 


Suppose that the hypotheses Hl, H2, H3, (2.3.3) anā (2.3.4) 
hold and assume that the discretization parameters obey the 
relation 


At = olh). (2.3.6) 
Then 
r 
| [u-u l| + ||p-p, || < C (h? + At) 
“h Pomno’) a lomim T 
Vr: 1< r < min(2,s), 
| |u-u,| | < c, h” + dt) Yr:1<r<min (2,5) , 


aort (9)7) T 
o 


446 SÜLI 


for h sufficiently small. The sizes of the At terms depend 


2 
principally on | [peu | 2 + where Du = D, (Du) . 


2 
L OTL (0) 
Remark 1 


For reasons of stability we require that the time step 
vanishes faster than the spatial mesh (At = o(h)). Since 
olh”) = o(h) for r > 1, this constraint is important only in 
the case min (%,s) = 1. 

Theorem 2 (Süli, 1985) 


Suppose that the hypotheses Hl, H2, H3, (2.3.3) and (2.3.5) 
hold and assume that the discretization parameters obey the 
relation (2.3.6). Then, 


| lu-u, | | TAIS came + At) Wr: 1 < x < min (£,s), 
2° (O,T;L (9) ^) 


for h sufficiently small. The size of the At term depends 
principally on | |peu| | 


L? (0,7; (9) 7) 


Remark 2 


+l 
Since olh” ) = o(h) for r 2 l, the mesh restriction (2.3.6) 
is insignificant asymptotically. 


Example 


In the Hood-Taylor mixed finite element method the elements 
KE Ch are quadrilaterals and the approximating spaces are 


defined as 
CIENTS (OW rie: <2 (a ae a ee CG } 
h o Hip IE SA A E 27 NKE CF 
2 
o 


= N cñ) : A 
M {q e L (2) N c(Q) : aS Q (K); YKE Cc}. 


Theorems 1 and 2 for h sufficiently small and % = 2 give the 
following error estimates: 


lluy ll 1 + ||p-p, | | < C,(h? + At), 
h'l 2 2 4 


L (0, 7H, (9)7) & (0,T;L (2)) 
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3 
[lu - all < C (h + At). 
h gecom (992) E 


For m=0,...,M-l, the discretization procedure (2.3.1) leads 
to algebraic systems of the form: 


a B? yett ag out 
BO put o 
with a symmetric matrix - the same matrix: on each time level. 


In practice, the vector gut is calculated by numerical 
quadrature rules (e.g. Gaussian rules). We fix the quadrature 
points ie k=1,...,K at time level m+l and we ask where they 


came from at level m. In other words, (2.3.2) has to be solved 
only for xX k=1,...,K. Since the spatial mesh is fixed for 


all time the implementation of the method is simple. 
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OPTIMAL ERROR ESTIMATION FOR PETROV-GALERKIN 
APPROXIMATIONS IN TWO DIMENSIONS 


B.W. Scotney 
(Department of Mathematics, University of Ulster) 


l. INTRODUCTION 


We examine the optimality of Petrov-Galerkin approximations 
for the linear diffusion-convection equation in two dimensions. 
The Riesz Representation Theorem provides a relationship 


between the trial space s% - test space T” pairing and the 
optimality of the associated Petrov-Galerkin method. We 
present an analysis which leads to optimal error estimates for 


the Petrov-Galerkin approximation U © sh of the form 


[u-u]; <c inf |u-v| involving the smallest possible constant 
VES 
C, where | . lii denotes the norm fix 240)". 


The diffusion-convection problem takes the form 


Lavu + V. (bu) = f ing, 
(1.1) 


u=gon 92 du/dn = ð on an, 


1”! 
for a quantity u. Here, a is a constant scalar diffusion 
coefficient and b = b(x), xE, is a vector convective velocity. 


Dos 
We suppose that 2 is a bounded open region in R with a 
polygonal boundary 92 = an, Van, and 92,902, = >; 9du/dn 
denotes differentiation in the direction of the outward 
normal on dy- 


We adopt the notation 


WYP = {w:p°w € LP y a such that Jal <m}, (1.2) 
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2 
and we denote by u? the space we - see Oden and Reddy (1976). 
i 1 2 
There exists a continuous linear mapping yass 1 AT ap (321) ’ 


Ay Si 
and we suppose that g is such that there exists G € H with 
¥, (2) = g. We define the space of functions 


1 1 “i 
i ={veuH lyy = 0Oon aaj} 5 (1.3) 


We may then define 


ia 1 1 
Heo iv EH bape Ba) > (1.4) 


We shall also use the space HR. As general references, see 
Ciarlet (1978), Girault and Raviart (1979). 
The weak formulation of problem (1.1) is to find u € m such 
that 
B(u,v) = <f, v> Yve ni K (1.5) 
o 
We make the following assumptions : 


2 2 
Een, O giar obeng] y inbi? Oon Ny Wb=0- 


1 1 
For v € H_ we may write v = F 
z y te v rA G, where ve ‘= Ha We may 
thus reformulate (1.5) as: find u E a such that 
o 


B(u_,w) = F(w) Ywen, (1.7) 
o E 
o 

where the bilinear form B(.,.) : HR x a > R is given by 
o 


A J 1 
B(w, W3) = <aYw -Yw,> + <V. (bw,) »w, > Yw E€ H /R, w E Hoos 


2 2 
o 
(1.8) 
and the linear functional F(.) : ul > R is given by 
BS 
F(w) = <f- V. (bG),w> - <aVG,Vw> Wwe oa ` (1.9) 


o 
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We may establish the existence and uniqueness of weak 


i ak 
solutions v E Ho to (1.7) and u € m to (1.5) via the 
o 
Lax-Milgram Theorem -see Ciarlet (1978). 


We shall assume that the trial space sh Cc a, and that it is 
of dimension N, spanned by a basis KAD i= 1,...,N}. The 


space st is defined as 


h h l 
= n 
S5 S Ha . (1.10) 


o 
We will consider only g for which there exists oh € s” such 
that Y = g. We may then define 


sh=(v-c'+w|westh. (1.12) 


The Galerkin approximation to (1.5) is to find U E sp such 


that 
h 


B(U,V) = <£,V> VvE S5 . (1:12). 


Since s? c n the existence and uniqueness of U €E sr are 


o 
guaranteed by the Lax-Milgram Theorem, and the following error 
estimate due to Babuška and Aziz (1972) holds: 


[u-u] <I + c/c] ing, luvly A (1.13) 
VES, 
E 
Here, | . li denotes the norm <T., T> where T is the operator 


* * * 
T. = a V. derived from T T = 4(L +L) where L is the formal 
2 
adjoint of the differential operator L. = -aV . + V.(b.). If 
the trial space is characterised by the estimate 


k-1 
Jeu, LEEI (1.14) 


where UL (x) € sh is the finite element interpolate of u(x), and 


h represents an element size, (see Strang and Fix (1973)), the 
ratio c,/c, in (1.13) may be replaced by a mesh Péclet number 


|p| h/a. The Galerkin method may therefore become "near- 
optimal", but if the Péclet number is large, this will occur only 
for extremely small values of h. This is precisely the 
situation when the flow is convection-dominated. 
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Many authors have proposed Petrov-Galerkin methods for 
ps 
problem (1.1). A test space T” (a xt other than S is employed. 
Setting m = T N ca , the Galerkin system (1.12) is replaced 
o 


by the problem of finding U € a such that 


h 
B(U,V) = <£,V> VV ET). (1.15) 


q” has the same dimension as Be and is spanned by a basis 


{y, i= 1,...,N}. The problem of choosing 5° to adequately 
represent the solution to (1.5) remains, but the principal 
problem now is the selection of the test space for a given 
trial space. 


2. OPTIMAL ERROR ESTIMATION 


It is convenient to introduce the operators 


k 
T zay, T= Vee pyar (2.1) 
We may then write the Petrov-Galerkin formulation (1.15) as: 
find U € s such that 
<T U,T V> +f b.nUvas = F(V) yver. (2.2) 
DTA Esta A o 
2 


Let Hy denote the Hilbert space m equipped with the inner 


o 
Product 


VWa = <a"vy,a“yw> = <T WTW x (2.3) 


We note that <er? provides a norm for a/R, and we denote 


T 
by H the Hilbert space H/R equipped with the inner product 
(2,3). Then using the Riesz Representation Theorem, there 
exists a map R} ? A) x B such that 


T y Š 
<T w T w> ‘etl benw,w,4s = <T w, T] (R,w,)> Vw, © H, w, € H}. 


2 (2.4) 

Hence we may consider generating an approximation U & s? to 
the problem defined py (1.7) s (1-8) and (1.9) using a test 
Space which is related to the trial space through the 
transformation 
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h h 
RT, 85° (2.5) 


The existence of an inverse RE 3 H sj H is guaranteed - 
see Oden and Reddy (1976). 


Using (2.4), the Petrov-Galerkin formualtion (1.15) may be 
written as : find UE st such that 


9 h 
<T U,T, (RV) > m PIN V E (2.6) 


Hence if the choice of test space in (2.5) could be achieved 
exactly, the approximation obtained from (2.6) is optimal in 
the norm lav. | defined by Ti 


Petrov-Galerkin formulation fails to satisfy (2.5) may be used 
to identify the extent to which optimality of the associated 
approximation is lost: 


. The extent to which a given 


Theorem 2.1 (Morton (1982)). 


For each v € H let R : H > Hy) be such that 


B(v,w) = <V RW Vwe ne (2.7) 


Then if the constant 4) is such that 


h 
inf, |v-r w|, <4 |v], yves (2.8) 
wer? TEU S yl o 


and U is the Petrov-Galerkin solution to problem (1.15), and u 
is the solution to (1.5), the following estimate holds: 


4 


uu], < (1-A) ang, uov], . (2.9) 
l1- ai h ak 
VES, 


Proof 


See Morton(1982). (For a proof of the Riesz Representation 
Theorem, see Adams (1975).) 

The smallest constant aan in the approximation 
property (2.8) measures the degree of success in approximating 
the trial space by the test space transformed using the Riesz 
representation operator Ry: It is this smallest constant which 
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controls the optimality of the associated Petrov-Galerkin 
solution. 

We present a general technique for evaluating Ay as 


V-R W| 

A, = sup, inf l at 

l ves? wer” v 9 Res 10) 
o o ab 


h 
Any element V € S5 may be written as 


N 
v= = Vọ, (2.11) 
AEn ie 
and any element W E€ ” as 
N 
w= = Wy . (2.12) 
i=1 thes i 


We denote by V the N-vector with components Vy! j =1,...,N, 


etc. By introducing the three N x N matrices A, B and C whose 
(i,j) entries are 


Pay See aL 


ee wh gma 2 
Cry = <b 1057) , 
respectively, we may obtain 
T 
a = sup, (1+ Zarar RTE (2.14) 
eR xx 


where Q = a *BC™?. 


2 
The constant a may thus be obtained by computing the largest 


eigenvalue of the matrix 1-079. Alternatively we may compute 
the smallest eigenvalue A satisfying the generalised 


eigenvalue problem 
T -1 
BA BV = à, CV ’ (2.15) 


(see, for example, Wilkinson (1965)), and thence obtain the 
2 = 
error constant (1-47) * via the relationship 1-44 = Ay: 
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For the general diffusion-convection problem in one 
dimension, it is possible to construct explicit analytic 
expressions for the Riesz representation operator RI and for 

3 =a r 
its inverse R - The matrices, A, B and C in (2.13) may thus 
be constructed, allowing an analytic stuđy of the generalised 
eigenvalue problem (2.15). This is performed by Scotney 
(1985), resulting in optimal error estimates of the form (2.9) 
in which the optimal constant aan” is evaluated in terms 
of the mesh Peclet number and simple functions of the test 
space. The analysis is used to generate "near-optimal" test 
spaces by the selection of upwinding parameters which minimise 
the constant in the optimal error estimate. Griffiths and 
Lorenz (1978) have also investigated the optimality of Petrov- 
Galerkin methods, though their error estimates are based on the 
non-optimal bound (1.13). 


3. DECOMPOSITION IN TWO DIMENSIONS 


In two dimensions the Riesz representation operators 
relating the optimal test space to the trial space are no 
longer explicitly available. We consider a decomposition of 


[17]? into the direct sum of curl-free and divergence-free 
subspaces. This enables us to perform an analysis in which 
the Riesz representation operator is not explicitly required, 
at the expense of approximating a family of self-adjoint 
boundary value problems, each associated with an element in a 
basis for the test space. 


We remove the dependence on the explicit form for the Riesz 


representer R from the matrices in (2.13). The matrix C does 


not involve Ry and we may use (2.4) to immediately write 


Bij = <R et (3.0) 


The removal of Ry form the matrix A is described below. 


We restrict consideration to the diffusion-convection 
problem (1.1) with Dirichlet boundary conditions. We shall 
further assume that 2 is a bounded simply connected Lipschitz 
domain, (see Oden and Reddy (1976)). The map Ry H > Hy is 


defined by (2.4) with Ti and T3 given by (2.1) so that, using 


Green's Theorem and V.b = 0, 


<aVv, Vw + (b/a)w - V(R Ww) > =O VvEH, WE H a F2) 
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Equation (3.2) is used to write the (i,j) entry in A in the 
form 


Agy = RP Riva m ALR 5) LRH)? 


= <a (RY 5) DY, + (b/a)Ņ,;>. (3.3) 


1,2 
Since (b/a) € Wate (b/a)y, € [a , and we consider the 
following decomposition theorem: 


Theorem 3.1 


Let RC R? be a bounded simply connected Lipschitz domain. 
1 
Then since (b/a) 4; (3 [x7 there exists a function a,€ H and 
a function zE u? such that 


(b/a) y; = Ya, + Vxz (3.4) 


nie 
aq, /ən = (b/a) p,m =O on oN. 
Proof 


See Girault and Raviart (1979), pp. 29-30. Moreover, a; G H? 
is determined uniquely up to an additive constant, giving a 


unique 4i E€ H. Further, Zi € x may be determined uniquely as 
the solution of 


<Yxz, ,Vaw> = <(b/a)y,Vaw> Vwen, (3.5) 


Zs = 0 on 32 


- see Girault and Raviart (1979). We note that in two 


dimensions the vector curl, Vxa, denotes Vxa = (9a/3y,-3a/3x) -, 
whilst the scalar curl, Vxa, denotes Vxa = da, /dx - da, /əy; 


T 
where a = (ajay) - Using Green's Theorem, Zi E H5 is the 
unique solution of 


<Ve,,Uw> = <Vx( (b/a)p w> Vwen, (3.6) 


rad O on dN. 
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The solution z, E€ n N A (Grisvard (1976)). 


i 


We now use (3.4) to rewrite equation (3.3) as 


A. = . . 
ij AAIR TWI DALS + q,) F Vxz5)> (3. 7) 
By use of (3.2) and (3.4) again, we may remove 4 from (3.7) to 
obtain 

ai = sawi + E ETAL N + (b/a) 4> - PAE NAE E R: 


This is the representation introduced in Morton and Scotney 


(1985). The operator R has thus been removed from the 


matrix A at the expense of introducing the functions 2i and Fie 
4. APPLICATION TO ERROR ESTIMATION 


We apply the analysis introduced in Section 2 and Section 3 
to evaluate the optimal constant CE in the error 


estimate (2.9) for the "upwind" Petrov-Galerkin approximation 
of Heinrich, Huyakorn, Mitchell and Zienkiewicz (1977) anā for 
the Galerkin approximation to the diffusion-convection 

problem (1.1) in two dimensions. For ease of presentation we 
consider a rectangular domain 2 = (x, 7%,) x (yor¥p) discretised 


using a uniform grid characterised by a parameter h with nodes 
at positions TY) = (x + khyy, + lh), k=0,..-/N; 1=0,...,M}, 


os oy h 
and xy = žb’ Yy = Yje S is the space of continuous functions 
which are bilinear on each element. The test functions Ų 4 ry) 


are tensor products of the one-dimensional test functions 
displayed in Fig. 4.1 - see Heinrich et al (1977) for details. 


(a) (b) 


Fig. 4.1 Test functions in one dimension given by Heinzach et al 
(1977) at mesh Péclet number (a) bh/a = 5, 
(b) bh/a = 50. 
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For each test function vi E ™, we approximate the self- 
adjoint boundary value problem (3.6) by a Galerkin formulation 
to find Z,€ = such that 

<¥z,,Vo,> = <Vx( (b/a) Y). >Y € S (4.1) 
Vi AL i b, a) > k al . 


where 3 is a piecewise bilinear finite element trial space on 
the A Sie of p i characterised by an element size h. In our 

computations we have taken fhe h/l0. An approximation Zi to Zi 
is thus constructed for each Ųų i € Th. Each entry in the matrix 


A is thence constructed using (3.8). The matrices B and C are 
constructed using (3.1) and (2.13) respectively. 


The optimal constant aan” in (2.9) is obtained 


numerically by application of a conjugate gradient algorithm 
to determine 


anne ( view i (4.2) 
sid = sup, mm ai . . 
1 Gent Pay 


To the extent that (4.1) is an approximation to (3.6), our 
analysis is, of course, not exact. 


The Galerkin formulation is analysed in the same way simply 
by replacing vy by ti in (4.1). 


Two different domains with two different convective velocity 
fields are considered. Firstly 2 is the unit square and the 
velocity field has constant magnitude |p| = 1 and constant 
direction inclined at an angle @ to the x-axis. Three values 


of 0 are considered : 0, 21.8° and 45°. This corresponds to 
the test problem of Raithby (1976). Secondly Q is the a 


illustrated in Fig. 4. A the velocity field b = (bys b, Es has 
components be 2y(1=x" ) and by = -2x(1-y" ). This PTE 
to the test problem proposed by Hutton (1981). 
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+1 


=A 0 +1 x 


Fig. 4.2 Domain and streamlines in test problem of Hutton 
(1981). 


Our results are displayed in Table 4.1 and in Table 4.2. 


1.0000 


2.8833 

8.3276 
15.023 24.710 
17.117 45.848 
19.631 405.96 
19.928 3183.6 


Table 4.2 Optimal error constants for the problem of Hutton 
(1981). 
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A PSEUDO-SPECTRAL SOLUTION OF VORTICITY-STREAM FUNCTION 
EQUATIONS USING THE INFLUENCE MATRIX TECHNIQUE 


J.M. Vanel and R. Peyret 
(Département de Mathématiques, Université de Nice, France) 


and 


P. Bontoux 
(I.M.F.M., Université d'Aix-Marseille 1l, France) 


l. INTRODUCTION 


In this paper we present a fast and accurate method for 
computing unsteađy two-dimensional incompressible flows. The 
method makes use of the vorticity and the stream function as 
dependent variables approximated by means of Chebyshev 
polynomial expansions. When using these variables the delicate 
question is the derivation of proper boundary conditions for 
the vorticity. The treatment of these conditions must preserve 
accuracy, stability and should avoid the use of an iterative 
solution procedure. The method developed here makes use of the 
influence matrix technique. This technique which has been 
found successful for the velocity-pressure equations (Kleiser 
and Schumann, 1980; Le Quéré and Alziary de Roquefort, 1982) 
has been applied until now to vorticity-stream function 
equations only in cases which reduce to one-dimensional 
problems (Tuckerman, 1983; Dennis and Quartapelle, 1983). 

Our purpose is to give a two-dimensional formulation of the 
method, to point out the theoretical and numerical difficulties 
associated with it, and finally to present numerical results 
illustrating the properties of the method. 


2. EQUATIONS AND NUMERICAL APPROXIMATION 


The Navier-Stokes equations within the vorticity-stream 
funetion formulation are 


OW Lhe take be y 
Seon Re Vw = - V.Vw + £ = F(p,w;£) (2.1) 


vey +0 =0 (2.2) 
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where the stream function y and the vorticity w are connected 
to the velocity v by 


[J 
Fem (Ht, -#) oe R-B. (2.3) 


These equations are solved in D (~l< x,y<l ) with the 
boundary conditions on I = 3D. 


veg, =h (= unit normal to T). (2.4a,b) 


The initial condition at t = O is 


V= 0°, then v = 0%. (2.5) 


In (2.1), Re is the Reynolds number and f is a given forcing 
term. 


The time-discretization is a combination of the 2nd-order 
Backward Euler scheme for the viscous term with an Adams- 
Bashforth type evaluation of the convective term, i.e: 


1 n+l n n-1 l „2 ntl n -l 
mee = ww) = A = 2F° - F" (2.6) 


2 n+l 4 untl 


vy =o (2.7) 


where n refers to the time Pere nit. This second order scheme 


is preferred to the usual Crank-Nicolson/Adams-Bashforth 
scheme for its better stability properties: 


(1) the higher Fourier or Chebyshev modes are better damped, 


(2) for moderate Re the critical time-step is larger, and for 
small Re the scheme is unconditionally stable, as found by 
Zakaria (1985) for the advection-diffusion equation solved with 
a Collocation-Chebyshev method. 


So, at each time-step the following Stokes-type problem has to 
be solved: 


vu - w =G (2.8) 
2 in D 
Vý+w=0 (2.9) 
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veg, tenon (2.10a,) 


where w and are written for Pije and eT, respectively. 


The approximation of (2.8) - (2.10) makes use of the 
Chebyshev polynomial expansion: 


w N M ee 
z b ted rt T(x) 7, (y) (2.11) 


y] n=o m=o vn 


The Chebyshev coefficients of G in (2.8) are calculated by the 
pseudo-spectral technique which consists in performing the 
products in the physical space and derivations in the spectral 
space, these two spaces being connected through a FFT 
algorithm. 


The unknowns ĉa Si (A m ore obtained through the Tau-Method 
' , 


(see Gottlieb-Orszag, 1977) associated with the matrix 
diagonalization technique of Haidvogel and Zang (1979) for the 
solution of the systems. 


The classical difficulty associated with the formulation 
lies in the determination of boundary conditions for the 


vorticity. The technique which consists in using w = - vy on 
the boundary has been employed in association with a Tau- 
Chebyshev method by Elie et al. (1983). We present here another 
solution to this problem which makes use of the influence matrix 
method. The advantage of this method lies in the strong 
coupling at the same time-level (n + 1) between w and y on the 
boundary, leading to good accuracy in time and stability 
without requiring any iterative procedure. 


3. INFLUENCE MATRIX METHOD 
3.1 Description of the Method 


Thanks to the linearity of the problem, the solution of (2.8)- 
(2.10) can be written as 


w w J v 
=f E y (3.1.1) 
(7 yj i=l Yi 
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In this formula, (@,)) is the solution of Problem A: 


Vo-o = G in D 


| g E A 
w = D (arbitrary) on T 
Wea fone! 
(o i iie EE AE y 
weg onT . 


In the summation in the equation (3.1.1) the number J is 
connected with the number of collocation points Ej nj) on T. 


These w» Yi) are solutions of Problem B: 


2 ; Oy: seth 
Vu, - Aw =o in D Wav ir OP in D 


2 = r 
Ge 845 on T ag) on 


where j refers to the points (jn) and Sij is the Kronecker 
symbol . Then the J coefficients Yi are determined such that 


Ņ satisfies the boundary conditions (2.10b), i.e. 


ms T. ow 
a) -a -(@) +2. @) 
= = Yi (3s Led) 
av 5 j av 5 ie! i \ov j 


This equation, written for all j, yields an algebraic system 
for the y,'s. The matrix A = [ (y, /3v) 4] of this system is the 


"influence matrix". 


Note that: (1) Problem B is independent of the time- 
level considered. Consequently, the calculation of Oar A 


and its inverse D can be done once and for all before starting 


the time-integration. (2) It is more economical to define the 
boundary values of w 


w =w +y, oT 3.1.6) 
T Wake WR ap E f 


and to solve a problem A [with (3.1.6) as boundary condition 
for w] rather than to store the J fields wY 1) anā-to perform 


the linear combination in | E Fr an S 
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3.2 Problem of corners and determination of J 


The presence of corners in the domain D leads to 
difficulties. Two methods have been considered to surmount 
these. 


Firstly, we assume that f,g and h are such that Ņ € Cso 
that w at a corner C can be expressed by: 


2 
Wig = EER E (3.2.1) 


which is known from the condition (2.10a). 


So, in Method 1, we prescribe 


4 2 
e= isje“ Tle (3.2.2) 


ile =0 (3.2.3) 


and the equation (3.1.5) is written at each collocation point 
on T except the corners; hence J = 2(N + M -2). 


In Method 2, f,g and h are assumed such that vy = c so 
that the equation (2.8) is satisfied up to the corner C. Now, 
together with (3.2.2) and (3.2.3), we can prescribe 


Vat =- Yje + Gje (3.2.4) 
vu lc =0 (3.2.5) 


Because of these additional conditions, the equation (3.1.5) 
must be written at J = 2(N +M - 4) collocation points only. 


At the four remaining points, the values of w = Bg and vi are 


determined se that (3.2.4) and (3.2.5) are satisfied. Here, 

we have chosen these special points S adjacent to the corners 
(Fig. 1), that is to say in the regions where the density of 

collocation points is high. 
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Fig. 1 


3.3 Difficulty associated with the Tau-Method 


The use of the Tau-Method to solve problem B for Vy leads 


to a difficulty arising because the last weighted residuals, 
corresponding to the higher modes, are not considered (they are 
replaced by the boundary conditions). So it appears that there 


exists a linear combination w of the Wy such that the only non- 


zero coefficients have indices n =N - 1 or N, or m=M-1 or 
M, precisely those coefficients which are not taken into 
account in the calculation of the corresponding stream function. 
Consequently we have a numerical solution y = O and the matrix 
A is not invertible. This difficulty can be surmounted if the 
whole spectrum of the vorticity w, is considered when solving 
the equation for Ņ,. This is done by increasing the number of 
polynomials in the expansion (2.11) for wy, which goes up to 

N +2, M+ 2 now. 


We are thus led to the problem of expressing 3/9x and 
dy/dy, defined by (N + 3)(M + 3) coefficients, at the usual 
(N + )(M + 1) collocation points (cos nt/N,cos mt/M). ‘The 
evaluation of G is done by simply neglecting the last two 
coefficients in either direction. On the boundary, two 
variants have been considered: 


Method 1, 2: all the coefficients of 3W/dv are taken into 
account. 


Method 1',2': the two last modes of 00/dv are neglected. 


The numerical results have shown that 1 is much more accurate 
than 1', and that 2' is slightly better than 2. 
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4, NUMERICAL RESULTS 

All the calculations reported here have been made with N = M. 

4.1 mxact unsteady solution 

We consider the solution 
27 - 1 + sin 2mt 


2 2 
Yo Ug Wg = FLY”) 
20 


2 ex(y-)) 
(4.1.1) 


which defines f, g and h. Here we compare the accuracy of 
various time-discretization schemes: 


(1) 15t-order Backward Euler/Adams-Bashforth 


(2) 274_order Backward Euler/Adams-Bashforth type [ Eqs. 
(2.6) - (2.7)]. 


(3) Crank-Nicolson/Adams-Bashforth. 


The calculations have been made with Re = 1, N = 20 and 
O.lsA\ ts 0.005. With these values of N and At, the leading part 
of the error comes from the time-discretization. Fig. 2 shows 
the normalized mean quadratic error: 


E = = I 8 ol. 
Ey aty Wore n2/ ex 22} (451:2) 


The presence of three levels in time in the schemes 
necessitates starting-up procedures with modified schemes 
which, in the present calculations, are first order only. 
Hence, the maximum in (4.1.2) is taken when_the effect of the 
initialization has disappeared: the error Eg is then periodic 


in time. From Fig. 2, it can be seen that scheme (2) is 
slightly more accurate than scheme (3), at least for the 
special solution computed here. 
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Fig. 2 Error E, 
4.2 Exact steady solution 


In order to compare the accuracy of methods 1 and 2' we 
have considered the steady solution ve = Yor using the scheme 


(2.6) - (2.7) with Re = 1. From Table 1, it is seen that 
method 2' is more accurate than method 1 for small N. The 
accuracy is comparable with some advantage for Method 1 when 
N is larger. 


N lo 12 16 20 


6 11 


Method 1 1.223x10 ` 5.828xl0 ° 9.179x10 22 4.045x107 


A s, ~ -l 
Method 2' 2.928x10° 3.507x10" 1.723x10 1° 3.573x10 1° 
e ERSE EO A E aN A e 


Table 1: Error E 
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4.3 Steady Cavity flow 


Finally, the method has been used to calculate the steady 
solution of (2.1) - (2.2) with f = O, Re = 50 and with the 
boundary conditions: y = O on the whole boundary; ðọŅ/ðy = 


- (l - Eo at y = l, 3ọ/ðv = O elsewhere; and with the initial 


condition Ve = O and therefore wW = o. 


Fig. 3, which shows the critical time-step; makes clear the 
good stability properties of the method. 


0.50 

0.45 [ 

0.40 SN 
0.35 


0.30 \ 


\ 
0.25 N 


0.20 be} 


0.15 ae 


Fig. 3 Critical time-step 


Table 2 gives some results. Here, the variables have been 
redefined by using the side L of the square as reference 
length, so that Os X, `Y $ 1,¥ = w/2. Q = 2w and Re, = 100. 


refer to (Peyret and Taylor, 1983) for results given by other 
methods. Because of the strong variation of 2 on the upper 
side Y = 1, the consideration of the maximum on collocation 
points only gives a bad idea of the convergence of the method 
with respect to N. Since the spectral approximation gives the 
solution at every X E [0,1], we have also listed in the last 
colum of Table 2 the maximum value of 2(X,1) on 201 equally 


spaced points. 
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Table 2. Results for cavity flow, Re, = 100 


Method nN max|y| "©? maxlacx,2)| ©) max|a¢x,2) |‘) 
pseudo-spectral 16 0.083139 13.4235 13.5108 
1 20 0.082692 13.1519 13.4357 
24 0.083315 13.4149 13.4357 
32 0.083403 13.3418 13.4462 
pseudo-spectral 16 0.083158 13.3475 13.4527 
2 20 0.082695 13.1790 13.4506 
24 0.083315 13.4260 13.4490 
32 0.083403 13.3441 13.4441 
ct ke ES SU RI YT eA, A 
pseudo-spectral 16 0.083057 13.3574 13.4485 
(a) 20 0.082545 13.1810 13.4453 
24 0.083201 13.4315 13.4522 
32 0.083265 13.3496 13.4517 
ph ee ie a rr 
Collocation (V,p) 16 0.083686 13.3574 13.4428 
(b) 32 0.083685 13.3422 13.4445 
SEE ee RENEE CER F aly al at Sn 
Hermitian (c) 20 0.0835 13.31 = 
ee ET OE A 
Finite-Diff. (d) 20 9.0829 13.14 = 


Ce EE eS 


(a) Elie et al. (1983), 

(b) Ouazzani and Peyret (1983), 

(c) 4th-order, Bontoux et al. (1978), 

(d) 2nd-order, Peyret and Taylor (1983), p. 205, run 16, 
(e) taken on collocation points, 


(£) taken on 201 equally spaced points and obtained at X=0.62. 
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Fig. 4 shows the evolution of 2(X,1). On the scale of the 
figure the difference between N = 16 and N = 32 cannot be 
distinguished. Fig. 5 is a blow-up of a very small region near 
X = 0. This provides an illustration of the high accuracy of 
Chebyshev approximation near the boundaries. Also, it is 
interesting to note the good agreement of the present results 
with those obtained by a Collocation - Chebyshev - Artificial 
Compressibility method (Quazzani and Peyret, 1983) for the 
velocity-pressure equations, the latter method needing no 
special treatment at a corner. 


The CPU time (CRAY 1S) is 0.03s/time-step for N = 20 and 
0.078s/time-step for N = 32. The preprocessing is roughly 
equivalent to 4N time-steps. 


14 
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Fig. 4 Vorticity Q(X, 1) 
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Fig. 5 Vorticity Q(X, 1) near X = 0 
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THE USE OF CHEBYCHEV POLYNOMIALS IN SPECTRAL 
FLUID DYNAMIC SIMULATIONS 


P.R. Voke and H.M. Tsai 
(The Turbulence Unit, Queen Mary College) 
and 
M.W. Collins 
(Thermofluids Engineering Research Centre, The City University) 


l. BACKGROUND 


This paper reports certain results of a mathematical nature 
arising from the use of two spectral (Fourier-Fourier-Chebychev) 
simulation codes for plane Poiseville flow, both of which have 
been used for full simulation (no modelling) and large-eddy 
simulation (incorporating a subgrid scale model). 


Fluid dynamic simulations take advantage of the fact that we 
know the equations of motions governing turbulence, and that we 
have large computing machines at our disposal. By choosing a 
computational box significantly larger than the expected 
correlation length of the turbulence under study, we can 
reproduce convincingly the properties of homogeneous, 
isentropic turbulence at low Reynolds numbers. Periodic 
boundary conditions are imposed on the box, and some random 
disturbance is set up, which is allowed to evolve in time 
according to the Navier-Stokes equations. The equations are 
treated in "k-space", that is, in terms of the Fourier 
coefficients. The nonlinear terms are computed by Fourier 
transformation into "x-space", giving values of velocity fields 
at a sequence of discrete mesh points at which the nonlinear 
product of fields are evaluated. Both rigorous and approximate 
methods are known for dealing with the aliasing errors that 
arise from this process when the product is retransformed to 


k-space. 


There are severe limitations on the use of such full 
simulations of the Navier Stokes equations. The finite number 
of Fourier modes that can be used in a real digital simulation 
Means that the Reynolds number must be very low. (The 
truncated Fourier expansion must provide a basis spanning the 
attractor of the flow in function space, which therefore needs 
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to have a rather small dimension.) As the Reynolds number 
increases it rapidly becomes impossible simultaneously to 
resolve all the important scales present. The only hope of 
performing simulations at higher Reynolds numbers appears to be 
some form of large eddy simulation, defined as a simulation in 
which only a limited range of scales is resolved, and eddies 
below the resolution of the grid are modelled. For a review of 
the field, see Voke and Collins (1983). 


The introduction of one or more boundary walls changes the 
flows fundamentally. Even at Reynolds numbers just above 
transition, the rates of change of many important dynamical 
quantities become large as the wall is approached. In the 
context of a finite-difference (or mixed spectral finite- 
difference) computation, it is usual to employ mesh stretching 
in the direction perpendicular to the wall in order to achieve 
adequate numerical resolution of the dynamics, (Moin and Kim 
1982). At higher Reynolds numbers this becomes unwieldy; 
certain large-eddy simulations attempt to model the wall layer 
dynamics as well as subgrid turbulence, (Schumann 1975). 


2. CHEBYCHEV POLYNOMIALS 


Chebychev polynomial expansions represent an alternative to 
finite differences with mesh stretching. Orszag and Kells 
(1980) have used a fully spectral simulation to perform full 
simulations of plane Poiseuille flow using Chebychev polynomials 
in one direction and Fourier expansions in the other two 
directions. 


The attraction of Chebychev expansions is that they provide 
a spectral representation consistent with arbitrary nonperiodic 
boundary conditions, avoiding the Gibbs phenomenon. In addition 
they are closely related to cosine expansions, and Chebychev 
transformation may be performed as efficiently as cosine 
transformation, using fast Fourier transform (FFT) algorithms. 
The natural collocation points for the computation of nonlinear 
terms lie on a cosine distribution between the walls, reflecting 
the underlying weighting used to define the polynomials. 


3. THE FDS SIMULATION CODE 


The first of our codes, FDS, solves the fully coupled 
incompressible Navier-Stokes equations for the primitive 


variables u, (velocity) and p (pressure/density), which may be 
written in the form: 


aa leipe 2 
ti a5 aju +v9 u tF (3.1) 


3u, =O (3.2) 
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Fi is some forcing term such as a unit pressure gradient. The 


equations take a similar form if nondimensionalised using the 
wall shear velocity and half channel width H, with v then 
representing the reciprocal of the Reynolds number based on 
these quantities. We Fourier transform these equations in the 


x and Xp directions, (the Fourier wavenumbers are ki and k3) 


and expand in a finite Chebychev series in the X3 direction 


perpendicular to the walls. Simple order finite differencing 
is used in time, explicit for the inertial term and implicit 
for the others, giving a set of equations for quantities 

+ + 
u and p at the advanced time in terms of explicitly 


calculated quantities R 


i 
2 + + 92 2 2 + 
ie +ik,P SAE -k k, )u, = Ry i=1,2 (3.3) 
ox’ 
3 
2 $ eae KASIS) DA VAAR (3.4) 
Tae eT Seta ENEE T aE NE) ; 
3 ox 
3 
+ 
du 
PDs. a A 
iku) + ik,u, + Ix o (3.5) 


The derivatives with respect to x3 have been left in the 


x-space form here, but are computed by the well known recursion 
relation for the Chebychev coefficients of a derivative 
function (Fox and Parker, 1968) 


u' (k3) =u' (k,+2) + 2 (k3+1) u(k+1) (3.6) 


This recursive algorithm is inconvenient to computes explicitly 
on vector processors (it is best done on a large number of 
coefficient sequences in parallel). The equations are 
integrated twice with respect to x, to obtain the standard form 
of the left hand side. The matricés are given explicitly in 
Voke and Collins (1983 and 1985). The numerical integrations 
should be the precise algorithmic inverses of the corresponding 
differentiation algorithms, if the left hand side matrices are 
to be fully consistent with the right hand side terms. 


A graphical technique has been used to assist in the 
derivation of the matrices for the complex set of equations 
(3.4-3.6). The derivative algorithm for a sequence of Chebychev 
coefficients u(k3) (k=0,N3) is represented by figure 1, in 
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which each horizontal arrow represents simple addition, and each 
diagonal arrow addition of the term multiplied by the 
appropriate 2 (k3 +1) value. 


k=0 


= oon ae ee 


Lf 2k) 


et 
© SET T0 ZERO 
©) INFORMATION LOST 


Fig. 1 Information flow graph; differentiation of Chebychev 
coefficient sequence. 


The reverse algorithm, used for setting up the matrix for a 
first order differential equation, is shown in figure 2a. It 
shows that one degree of freedom is unaccounted for; this is 


fixed by the boundary conditions of the problem. Double 
integration is shown in figure 2b, and requires two conditions. 


eC 


4 “VY (k+1) 
N % (k +1) 


© PRESET TO ZERO 
© ARBITRARY 


RE OSGOCE Ey 
: OOOO 


Fig. 2 Information flow graphs; integration of Chebychev 


coefficient sequence; (a) single integration (b) double 
integration. 


k=0 


CHEBYCHEV POLYNOMIALS IN SPECTRAL SIMULATIONS 481 


Each distinct route from a point on the top line to one on the 
bottom line represents a contribution to the appropriate 
matrix element, calculated from the product of the factors 
associated with the arrows in the path. This graphical 
technique proved useful for setting up the correct matrix 
elements for the second order system, but has also been used to 
set up the matrix for a fourth order problem. Consistent, 
nonsingular matrices are obtained for the second order system 


for all values of ky and ky apart from k)=k,=0, which requires 


a single additional element to fix the mean pressure. 


A further computational technique is worthy of mention. The 
equations (3.3) to (3.5) have complex coefficents, but by 


maltiplying 3.3 by i (test); equations for the unknowns iu, 
iu, uz and p with purely real coefficents result. The 


matrices are banded, with a bandwidth of 20, but not all the 
diagonals are present. They are roughly equivalent to 

hexadiagonal matrices. Since the pattern of nonzero elements 
is the same for various values of ki and ky (unless either is 


zero), sparse matrix techniques are used to factorise and 
solve the system. 


4. THE CHANEL SIMULATION CODE 


This code was written independently by Antonopoulos-Domis 
(1983), and uses time stepping and Fourier-Chebychev expansion 
methods similar to those described above. However, it is based 
on a derived equation for Uz, obtained by taking the Laplacian 


2 
(22) of the u3 momentum equation, and substituting for ð p 


from the normal Poisson equation for pressure. This gives an 


equation for y=27u, of the form 


2 2 
PPA a AE A A ee US (4.1) 
where S is a nonlinear term computable entirely in terms of 
velocity components. Equation (4.1) is solved three times for 
combinations of zero or unit boundary conditions on y. Three 
different u. solutions result, (the Poisson equation is solved 


with u,-0 on the boundary), and a linear combination found for 


which 9,u,-0 at the boundary. 


The CHANEL code then finds the pressure at the advanced 
time algebraically, and solves for u and Uy from (3.1), 


incorporating the pressure gradients into the right hand sides. 
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In all cases the viscous terms are treated implicitly in time. 
Unfortunately Antonopoulos—Domis overlooked a still more 
efficient way to solve for u and Uys using the equation for 


the vorticity component W3r with boundary condition w70: 


2 
=Q2,+Vv9 
aw vow 


x = - 4.2 
k 4 37 8 gt (8yu, 3 u, )/2 (4.2) 


Neel bel 


This equation is simply one component of the curl of 
equations (3.1), and a, is the third component of the curl of 


the inertial term in (3.1). Once w3 and uz are known at the 


advanced time, it is trivial algebra to find u and uz in 


k-space from (4.2b) and (3.3). This method is to be 
incorporated into the CHANEL code shortly. No pressure 
solution is needed, 


In spite of the necessity for multiple solutions, the method 
proves to be much more efficient than that used by FDS, because 
of the decoupling of the unknown fields. 


5. RESULTS 


Both codes have been used for full simulation of very low 
Reynolds number flows, around transition ‘FDS) or buoyancy- 
driven (a code derived from CHANEL), but have also been used 
for large-eddy simulations of low Reynolds number turbulence. 
The Reynolds numbers studied in the simulations are in the 
range Re=1000 to 1000Q, based on maximum velocity and channel 


D + 
half width, or H =30 to 300 based on shear velocity. 

The full simulations around Re=1000 show transition to 
turbulence in the presence of finite disturbances between 


Re=1000 and 1100 (Voke and Collins 1985). 16 Fourier modes 
were used in each of the x and X, directions, with 17 


Chebychev polynomials used in the x3 direction. Sustained 


turbulence occurred at H*=40 (Re=1600). In spite of the low 
spectral resolution, there is no evidence of a pile-up of energy 


in the top Fourier modes for ut < 40. 


at higher Reynolds numbers, a subgrid scale model is needed, 
and a relatively crude eddy viscosity model was used. For 


j h + 
simulations at H =300, an apparently sustainable turbulent state 
was found even using 16 x l6 x 17 modes. However, an unphysical 
oscillation with respect to x3 is seen in many important 


parameters, figures 3 and 4. 
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Profiles of streamwise velocity u, averaged over planes 
parallel to the walls, predicted by code FDS. 16 x 16 


Fourier modes. 

Full simulation, H’=40, 17 Chebychev modes. 
t*=tu, /H=5.0. 
mui Large-eddy simulation, H*=194, 33 Chebychev 
modes, ¢*=3.0. 


+ 
-x---- Large-eddy simulation, H =300, 17 Chebychev 


modes, tt=2. 75. 
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Fig. 4 Profiles of turbulent kinetic energy, predicted by code 
FDS. 16 x 16 Fourier modes. 


Full simulation, H’=40, 17 Chebychev modes, 
tt=5.0. 
---x- Large-eddy simulation, H*=300, 17 Chebychev 
modes, tte2.6. 


This type of lateral wiggle is also found by CHANEL in several 
important quantities when using the same number of modes, 
figures 5, 6 and 7. 


F 


TO 0-0 1-0 
Fig. 5 Profiles of r.m.s. intensity Vu, predicted by CHANEL. 


Large-eddy simulation at H*=194, Arbitrary normalisation 
8 x 16 Fourier modes. 17 Chebychev modes. 
----- 16 x 32 Fourier modes. 33 Chebychev modes. 
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Fig. 6 Profiles of r.m.s. intensity fev >, predicted by CHANEL. 


Large-eddy simulation at H*=194. Arbitrary normalisation 
8 x 16 Fourier modes, 17 Chebychev modes. 
Sas 16 x 32 Fourier modes. 33 Chebychev modes. 


-1-0 0-0 10 
Fig. 7 Profiles of resolved Reynolds stress <uw>; CHANEL. 


Large-eddy simulation at H’=194. Arbitrary normalisation 
8 x 16 Fourier modes, 17 Chebychev modes. 
ek 16 x 32 Fourier modes. 33 Chebychev modes. 


The origin of these wiggles has been tracked down to a lack 
of adequate resolution of the wall layers. The large-eddy 
simulations are performed by incorporating into the equations 
given in the preceding sections an additional term that is 
the divergence of a Reynolds stress. This we term the "subgrid 
acceleration". Physically, this acceleration must be zero at 
the walls, and it is well known that components of Reynolds 
stresses are zero and have zero normal derivatives at walls. 
To ensure this, our eddy viscosity models, computed at the 
collocation points in x-space, incorporate an exponential 
damping factor of the Van Driest (1956) type. This results in 
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a spanwise Reynolds stress component 1,,, for instance, that 
peaks close to the wall, descends rapidly to zero, and has zero 


gradient at the wall. 


Figure 8 shows a graph of T31 as predicted by the FDS code, 


using 17 Chebychev polynomials. To give some idea of the 
resolution of the spectral method, the collocation points are 
marked. It is clear that while 33 Chebychev polynomials might 
be adequate to resolve the rapid changes of the stress, 17 
polynomials are unable to do so. 
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Fig. 8 Profile of subgrid Reynolds stress component T31 


predicted by FDS. Large-eddy simulation at H*=300, 
16 x 16 x 17 modes. 


This conclusion is confirmed by figure 9, showing the 
streamwise Component of the subgrid acceleration fhe which 


arises predominantly from 3 This is not equal to zero at 


3713" 
the wall, using 17 polynomials. In the code FDS the value at 
the wall is fixed to zero artificially, resulting in smooth 
Predictions for turbulent flow using 17 or 33 polynomials. 

The code CHANEL does not fix f, to zero at the wall, but obtains 
good agreement with the experimental results of Kreplin and 
Eckelmann (1979) using 33 polynomials. 
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Fig. 9 Profile of subgrid acceleration component £r predicted 


by FDS. Large-eddy simulation at H’=300. 16 x 16 x 17 
modes. 


Related to these constraints, it was found that even 33 
Chebychev polynomials cannot always compute physically realistic 
wall gradients. The wall shear component 33u is miscomputed 


ya 
by up to 10 percent when the values of u, are artificially 


set equal to a at the first five collocation points in initial 


field data. Clearly this will make it difficult to achieve a 
physically realistic mean flow in the statistically stationary 
state. 65 polynomials compute the gradient correctly. 

Our simulations show that Chebychev polynomials can be used 
very successfully in large-eddy simulations, but that some care 
must be taken that critical quantities are computed in ways 
that are physically meaningful, particularly in the near wall 


region. 
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STEADY VISCOUS FLOW PAST A CIRCULAR CYLINDER 


B. Fornberg 
(Exxon Research and Engineering Company, Annandale) 


l. INTRODUCTION 


Viscous flow past a circular cylinder becomes unstable around 
Reynolds number Re = 40. Although this first instability can 
be suppressed by inclusion of symmetry, other instabilities and 
limitations (wall effects etc.) make experimental results for 
the steady flow unreliable past Re = 100 (e.g. Grove et al. 
1964). The only earlier numerical calculation to achieve both 
convergence and accuracy past Re = 120 was by the present 
author (Fornberg, 1980). A change in trends was discovered 
near Re = 300, but a simultaneous growth of numerical errors 
prevented us then from resolving its nature. 


With a numerical technique based on Newton's method and made 
possible by the use of a supercomputer, we have now been able to 
obtain steady (but unstable) solutions up to Re = 600. It is 
found that the wake bubble (region with recirculating flow) 


grows in length approximately linearly with Re throughout this 
whole range. The width is found to increase like Re? up to 


Re = 300 at which point a transition to linear increase with Re 
begins. At the highest Reynolds numbers we reached, the wake 
resembles a pair of translating, uniform vortices, both 
touching the centre line. This observation forces a reassesment 
of earlier models for the structure of high Reynolds number 


wakes (e.g. Smith 1979). 
2. MATHEMATICAL FORMULATION 


With a cylinder of radius 1 and the Reynolds number based on 
the diameter, the steady-state Navier-Stokes equations 
(expressed in streamfunction ¥ and vorticity w) take the form: 
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A¥+w=0 
ul CA (awit O87 
du + > Re te Ee MERE IS 


(b) 


complex z—plane 
z=E+in 


15 20 25 


(2.1) 


(2.2) 


x=173.4 
` y=41.4 


5 


Fig. l a. Region in X-plane to be mapped to a rectangle. 
b. Corresponding region in the Z-plane. 
C. Image near the cylinder of a 241*49 grid. 
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Near the cylinder, all the vorticity is concentrated within a 
region like the one sketched in Figure 1 a. The conformal 


mapping Z = x? + x maps such regions in the physical X = x+ iy 


plane to rectangles in the Z-plane (Figure l b). In the new 
coordinates, the governing equations remain unchanged 


apart from w being replaced by w/|az/ax|? in (2.1). These 
equations were further modified by subtracting out the free 
stream and making a change of variables in the vertical 
direction to increase the resolution along the body surface. 
Figure 1 c shows the image in the X-plane near the cylinder of 
the coarsest grid we employed (241*49 points). 


3. NUMERICAL SOLUTION 


The governing equations were approximated by centred second 
order finite differences at all interior points and boundary 
conditions were implemented. In particular, for the body 
surface, we have two conditions for ¥ (and none for w). Along 
the top boundary, we complement w = O with a relation on Y 
derived from the fact that ¥ satisfies AY = O above the boundary 
with only the decaying modes present (for details, see Fornberg, 
1985). 


The resulting discrete non-linear systems of equations were 
solved by Newton's method. The structure of the Jacobian is 
shown in Figure 2 a. This coefficient matrix is first reduced 
to roughly half its size by eliminating all entries below the 
single diagonal (located in the top right corner block). The 
new structure is shown in Figure 2 b. The sizes of the blocks 
are given both for a general M*N grid and for our 541*109 grid. 


Row pivoted Gaussian elimination was employed (first for the 
oO 
banded part). Each linear system requires 5.1.x 10” arithmetic 


operations. This was found to take 403 seconds on a 2-pipe 

CDC Cyber 205, i.e. a sustained average performance of 127 
Mflops (Million floating point operations per second, 64 bit 
precision) was achieved. The times required for other parts 

of the calculation were negligible (for example the 

generation of each reduced Jacobian required only 0.04 second ). 
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Fig. 2 a. 


Linear system in Newton's method. 
b. 


Structure of the reduced Jacobian. 
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4. RESULTS AND CONCLUSIONS 


Figure 3 shows contour lines for the streamfunction and the 
vorticity at Reynolds numbers 100 to 600. In Figure 4, the 
vorticity fields are illustrated as surface elevations in order 
to highlight the emergence of the large area with nearly 
constant vorticity inside the wake bubble. The length and the 
width of the wake bubbles are shown in Figure 5. Results from 
three different grids are included (241*49, 361*73 and 541*109 
points respectively) in order to give an impression of the 
truncation error level. As we noted in the introduction, the 
length of the wake bubble is found to grow as O(Re) throughout 
the complete range considered. The width starts out as 


(Re?) but becomes also O(Re) following a transition around 


Re = 300. Other results (on pressure, drag etc.) as well as 
further error tests are reported in Fornberg (1985). Figure 6 
illustrates some earlier proposals for a possible limit as 

Re > © and an alternative suggestion based on an extrapolation 
of our present observations. Theoretical work aiming at 
refining such a model includes studies by Peregrine (1985) and 
Smith (1985). 
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Fig. 5 a. Length of the wake bubble. 
b. Width of the wake bubble. 
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Fig. 6 a. Some earlier proposals for the infinite Reynolds 
number limit. 
b. Possible structure at high Reynolds numbers based on 
trends observed in present work. 
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ON THE NUMERICAL SOLUTION OF 3-D INCOMPRESSIBLE FLOW PROBLEMS 


N.S. Wilkes, C.P. Thompson, J.R. Kightley, I.P. Jones and 
A.D. Burns 


(AERE, Harwell) 


1. INTRODUCTION 


This paper discusses some aspects of the numerical solution 
of three-dimensional incompressible flow problems using finite 
difference methods. These problems have traditionally been 
very demanding in computer resources, but developments in hard- 
ware, algorithms and software mean that they can now be solved 
on a routine basis using realistic grids. It is important, 
however, to take full advantage of these developments where 
possible. For example, in order to obtain significant benefits 
from a vector processor such as a CRAY-1 it is important to 
structure the software to enable vectorisation to occur and to 
minimise recursive work. New algorithms, based upon precondi- 
tioned conjugate gradients, for the efficient solution of 
systems of linear equations have been adopted in many fields, 
particularly in oil reservoir simulation. These methods offer 
the potential for being reasonably efficient on both scalar 
and vector processors. They have not, however, been widely 
used for the solution of turbulent flow problems. Other 
algorithmic developments have occurred in the treatment of 
velocity-pressure coupling using the primitive variable formu- 
lation of the Navier-Stokes equations. These points are 
discussed in this paper and illustrated by examples taken from 
the prediction of a realistic three-dimensional flow. 


2. BASIC ALGORITHM 


Most finite difference algorithms used for solving the 
incompressible Navier-Stokes equations in primitive variable 
formulation are based upon the SIMPLE algorithm, see Patankar 
(1980). The differential equations are first linearised and 
then discretised by integration over mass control volumes in 
the usual way. The algorithm then proceeds as follows:- 
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1) Guess the pressure field. 
2) Solve the momentum equations for the velocity field. ; 


3) Correct the velocity so that it satisfies the mass conser- 
vation equation by solving a pressure-correction equation, 
and then update the pressure field. 


4) Solve the transport equations for any additional scalars, 
for example turbulence quantities and temperature. 


(2.1) 


A number of different treatments of step 3 exist, for 
example the SIMPLE algorithm, and its variants SIMPLER 
(Patankar (1980)) and PISO (Issa (1983)). 


In a steady state calculation, one returns to step 2 with 
the latest estimate of the pressure field and cycles through 
steps 2 to 4 until convergence is reached. Because of the 
non~linearity of the system, coupling between dependent 
variables may be more important than a highly accurate solution 
for each individual transport equation. It is important, 
therefore, to do the minimum amount of work in the inner itera- 
tions which will not impair the convergence of the outer itera- 
tions. The robustness of this method, however, will be 
increased by having converged inner iterations, although this 
may be more expensive in computer time, Experience has shown 
that the pressure-correction equation should be solved more 
accurately at every stage in order to ensure that mass contin- 
uity is preserved in the fluid. This is important from the 
point of view of the robustness of the code, as well as 
providing the best rate of convergence of the outer iterations. 


Iteration is also used at each time step of a transient 
problem when implicit methods are used, The PISO algorithm 
for the velocity-pressure coupling, when used in transient 
problems, is a semi-implicit non-iterative method. For this 
algorithm, the correction for mass continuity is such that 
one cycle through (2.1) yields errors in the variables which 
are proportional to the same power of the time step as time 
truncation errors; see Issa (1983). Therefore, given adequate 
convergence of the individual transport equations, it is no 
longer necessary to iterate around (2.1), but more work must 
be done in the linear algebra of that one cycle. 
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The calculation may be divided into two phases:- 
a) the calculation of matrix coefficients, and 
b) the solution of individual discretised transport equations. 


If the domain of computation has a regular structure, which can 
be achieved by padding out and including dummy equations at 

the location of internal solid regions, a high degree of 
vectorisation may be obtained for the calculation of the matrix 
coefficients. Boundary conditions can be vectorised with care, 
including those for logarithmic boundary layers. Hybrid diffe- 
rencing may also be coded in a vectorisable form. 


Hence 100% of the coefficient calculation has been written 
as vectorised code, resulting in a ratio of overall execution 
times in scalar and vector modes of between 3:1 and 4:1 on a 
CRAY-1. Traditionally, somewhere between 30 and 80% of the CPU 
time spent in solving these problems has been spent in doing 
the linear algebra. Because of the high degree of vectorisa- 
tion of the coefficient calculation, the linear algebra will 
take a disproportionate part of the computations unless very 
efficient techniques are used. 


In the next sections, therefore, we examine efficient 
methods for solving the linear equations which may exploit 
their particular form and will vectorise effectively. Many of 
these algorithmic developments are highly effective on scalar 
computers as well. Since, as has already been seen, a large 
proportion of the work is devoted to solving the pressure- 
correction equation, its efficient solution is of particular 
importance and so will be considered separately. 


3. PRESSURE-CORRECTION EQUATION 


It is usual practice to ensure that discretisation of each 
of the transport equations results in a coefficient matrix 
which is diagonally dominant and hence positive definite. 
This is not quite true of the pressure-correction equation in 
an incompressible calculation, since the matrix is singular, 
corresponding to the freedom to add an arbitrary constant to 
the pressure. Another unusual feature of this particular 
equation is that its coefficient matrix is symmetric, which 
arises because the differential equation of which it is a 
discrete analogue contains no first order derivative terms. 
Such a symmetric positive semi-definite system is ideal for 
solution by the conjugate gradient (CG) method. This section 
describes the experience gained with the implementation of this 
method in some three-dimensional flow codes. 
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Since discretisations of partial differential equations 
typically yield matrices with a poor eigenvalue structure 
(evenly spread over a wide range), the matrix must be pre- 
conditioned. This is equivalent to solving the original 
system by means of the Preconditioned CG algorithm (PCG), which 
resembles classical CG apart from the additional calculation 


of wl, (see Concus, Golub and O'Leary (1976)). M is the 
preconditioning matrix, which is in some sense 'close' to the 
coefficient matrix, A. The PCG algorithm will vectorise 
straight-forwardly, except perhaps for this preconditioning 
phase. The correct choice of preconditioning is therefore 
important. A balance is sought in the amount of recursive 
work entailed by this phase of the calculation. A complicated 
but accurate preconditioning will make each iteration more 
costly, yet may give a better convergence rate so fewer 
iterations are required. 


Those preconditionings which have been employed include 
diagonal scaling, incomplete Cholesky factorisation (the ICCG 
algorithm), truncated incomplete Cholesky introduced by van der 
Vorst (1982), and the Nested Factorisation form of block pre- 
conditioning of Appleyard et al. (1981, 1983). 


Detailed testing of these algorithms on realistic flow 
problems has shown that there is no single best algorithm for 
all problems on all computers, which is one good reason for 
maintaining a range of preconditionings. Indeed, a powerful 
feature of the PCG method is its flexibility. ICCG is found 
to be most useful, in that it is efficient on both scalar and 
vector machines, reliable on the test problems so far studied, 
and can handle the singularity of the system provided a 
‘column sum constraint' is not enforced. 


One consideration in selecting the right preconditioning 
to use is the amount of storage available. On a computer such 
as a CRAY-1 which does not have virtual memory, this is a real 
concern when solving problems on meshes with over 40000 grid 
points (see section 5). The symmetry of the coefficient 
matrix can be exploited by storing, say, only the lower 
triangle of it. Enough workspace is then freed for the CG 
part of the algorithm. Each of the above preconditionings is 
economical on storage provided no fill-in is allowed in the 
incomplete factorisations. 


An important feature of fluid flow calculations is that 
often we have no a priori knowledge of the relative sizes of 
terms in the matrix. Of the above-mentioned preconditionings, 
truncated incomplete Cholesky and Nested Factorisation are 
dependent on the order in which dimensions are scanned, which 
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determines which terms lie close to the diagonal of the matrix 
and which are far away. Examples are reported by Kightley 
and Jones (1985) of problems which will converge using such 
anisotropic preconditionings with one ordering of the unknowns 
but diverge for another ordering. For this reason, an 
isotropic preconditioning such as incomplete Cholesky is 
preferred. 


Details of all these algorithms and a comparison of their 
relative performance on model problems and on matrices arising 
from realistic flow problems may be found in Kightley and 
Jones (1985) . 


These PCG solvers have been implemented for the pressure- 
correction equation in an existing 3-D turbulent flow code. 
The cost of transient flow predictions has been reduced 
substantially - by up to 25% of the cpu time - on both a 
CRAY-1 and scalar computers, as compared with Stone's Strongly 
Implicit Procedure (SIP) with a fixed parameter, which was the 
method previously used. Table 1 shows the number of iterations 
of SIP and ICCG required to reduce the residual norm in the 
pressure-correction equation to a preset level, for each of 
the two pressure-correction steps of the PISO algorithm, The 
first few time steps of a typical turbulent flow in cylindrical 
geometry are illustrated. The work of ICCG is only very 
slightly greater than that of SIP, so relative cpu times can 
easily be deduced from the figures, Greater reliability was 
also obtained, as demonstrated by smaller deviations in the 
numbers of PCG iterations required. The numbers of iterations 
required for SIP could be reduced by using a cyclically varying 
parameter. The amount of work per iteration would increase 
significantly however, since the matrix has to be re- 
factorised. The best choice of parameter to use is also not 
known. 


4. OTHER TRANSPORT EQUATIONS 


Efficient solution of the equations for transport of all the 
other dependent variables is often less critical to the overall 
performance of a flow prediction code, because less accuracy 
is required in the solution due to the semi-implicit treatment 
of non-linear terms. Greater diagonal dominance, enhanced by 
under-relaxation, also makes the solution correspondingly 
easier. Standard solution methods are point- or line- 
relaxation schemes, and incomplete factorisation techniques 
such as Stone's method and ADI. Latterly, some of these have 
been coupled with multigrid acceleration. Line relaxation is 
widely used to good effect in 2-D calculations and can be 
applied in 3-D also. Assembler-coding of the tridiagonal 
solver on the CRAY-1 can halve the cpu time in this essentially 
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Table 1 


Iterations on first and second pressure-correction steps of 
PISO algorithm. Grid size is 16x16x16. 


ERE SIP PCG 
step lst step 2nd step lst step 2nd step 
3 24 8 14 12 
4 16 7 17 7 
5 33 7 15 6 
6 12 7 16 11 
7 23 10 16 12 
8 34 18 17 14 
9 32 19 17 15 
lo 36 25 17 15 


scalar part of the computation, and has been found to be quite 
effective on many problems. 


A number of adaptations of CG techniques to the unsymmetric 
case have been developed. These include solving the Normal 


equations (aa) x = a'b, the bi-CG algorithm, and the Minimal 
Residual algorithm, see Appleyard et al. (1981 , 1983). A 
recent innovation is the CG-squared algorithm of Sonneveld, 
Wesseling and de Zeeuw (1983), communicated by Wesseling 
(1985). This has approximately the same cost per iteration 
as bi-CG and up to twice the rate of convergence. In view of 
the diagonal dominance of the systems to be solved by this 
method, diagonal scaling of the matrix may be sufficient pre- 
conditioning in many cases. Vectorisation is then very good, 
with vector lengths of the order of the number of unknowns 
which enhances efficiency and is necessary on some vector 
computers. It also is economical in storage, taking the same 
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amount as bi-CG. This method has been implemented and some 
promising results have been obtained. Further investigations 
are in progress and will be reported in detail later by 
Kightley (1985). 


5. ILLUSTRATIVE EXAMPLE 


The methods described in the foregoing sections have been 
implemented into the code FLOW3D (Jones et al, (1985)) and 
tested in predictions of the flow in the NTH experiment 
(Hjertager and Magnussen (198la)) depicted in Fig. 1. This 
consists of a narrow free air jet, issuing from a square duct 
into a rectangular enclosure, and leaving through a symmetri- 
cally placed square outlet of twice the linear dimensions of 
the inlet. Two symmetry planes were exploited, so that it was 
only necessary to perform computations in a quarter of the flow 
domain. This is an easy operation to specify with FLOW3D. 

The experiment was simulated on several different grids 
ranging from 9x9x9 to 31x31x42 in order to test the sensitivity 
of the results to the number of grid points. 


Two symmetry planes 


L 
==3, 
D = 100mm D o 


di a01 dza 0.2 Inlet velocity = 18.5ms 
Proti D 


Fig. 1 Geometrical configuration of NTH Experiment 
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Details of a number of predictions of the flow in this 
experiment on a 20x20x20 grid are given in Table 2. The 
numbers of iterations and corresponding CPU times to achieve 
convergence are shown for different combinations of the solver 
methods and the velocity-pressure coupling. The convergence 
criterion used for these results is for the mass source 
residual to fall below a fixed preset level, which roughly 
corresponds to the point at which all variables agree in their 
first three significant figures with the answer which would be 
obtained by continuing the calculation to a much lower level 
of the residual. As this is a steady state calculation, a 
tight tolerance on the residual of each linear equation is not 
needed, so instead a fixed algorithm was used. The pressure- 
correction equation was solved using 4 iterations of ICCG or 
6 sweeps of line relaxation with the orientation of the lines 
alternating between the two directions perpendicular to the 
main flow direction. On the unsymmetric equations, 1 sweep 
of line relaxation (LR) or 1 iteration of CG-squared (CGS) was 
performed. The work is roughly equivalent in each case. 


Table 2 


Iteration counts and CPU times with various algorithms for NTH 
problem on a 20x20x20 grid. 


Velocity-Pressure 
coupling 


Number of CPU Time 
Iterations (sec) 


Solvers 
Bae Other 


SIMPLE 


SIMPLE 1095 329.8 
SIMPLE 947 285.6 
PISO 796 324.0 
PISO(r) 794 371.3 


A comparison is also made between the SIMPLE and PISO 
algorithms for solving the velocity-pressure coupling. The 
PISO algorithm requires the coefficients from all the momentum 
equations for the second of its two Pressure-correction steps. 
This is extremely demanding on storage. For large problems it 
is not feasible to hold these coefficients in memory, so they 
must be held out of core. As I/O may incur a cost over and 
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above that of CPU usage, an alternative strategy is to re- 
calculate the coefficients using the latest available estimate 
to the velocities at every stage. The results with out of core 
working are labelled PISO, and with re-calculated coefficients 
PIsO(r). This re-calculation is strictly a modification of the 
true PISO algorithm, but will be a small change if the coeffi- 
cients are not rapidly varying during the course of the itera- 
tion, or for transient problems, in time. 


The differences in CPU times are not large because a compar- 
atively small amount of work is performed in the linear 
algebra on this steady state calculation. On other problems 
that are not reported here, the difference between velocity- 
pressure coupling algorithms has been more marked. The 
improved mass conservation obtained by use of the PISO 
algorithm permits an increase in under-relaxation factors to 
be made, which may result in a large reduction in the number 
of cycles through the dependent variables. As well as a 
possible reduction in CPU time, this has the effect of 
increasing robustness. On this particular problem the choice 
of velocity-pressure coupling algorithm is not so important, 
since the rate of convergence of the turbulence quantities 
restricts the overall convergence rate of the coupled system. 
A two-stage correction of k and £ recommended by Issa (1983) 
has not been used here, but may relax this restriction. 


In Fig. 2, predictions are plotted against experiment for 
the centre-line values of the mean axial velocity W, and the 
isotropic turbulence velocity w' = ¥(2k/3). The grids were 
of dimensions 20x20x27 and 31x31x42, the inlet being resolved 
by 16 and 64 mass control volumes respectively. The grids 
were non-uniform along the z-axis, with fine resolution near 
the inlet, and a fully developed profile was inserted a short 
distance upstream of the inlet, as depicted in Fig. 3. There 
are noticeable discrepancies between the predicted and the 
experimental results; W decreases too rapidly (i.e. the rate 
of spread of the jet is over-predicted) , and the predicted 
rise in k occurs further upstream than is observed experiment- 
ally. Whilst it cannot be completely discounted, numerical 
diffusion is unlikely to be responsible for the disagreement 
between the predictions and the experiment because of the 
agreement between the numerical results with different grid 
sizes. It does appear therefore that the standard k-e model 
is over-predicting the rate of spread of the jet. 


In order to compensate for this over-expansion, Hjertager 
and Magnussen (1981b) extended the potential core of the jet 
downsteam; inlet conditions were moved forward a distance 
consistent with experimental findings. They claimed that it 
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M/S 


Fig. 2 Mean axial and turbulence velocities on symmetry axis. 
Solid line = 31x31x42 grid, broken line 20x20x27 grid. 
Star = mean axial velocity, Diamond = V2k/3 


Fully developed profile Inlet 


Fig. 3 Inlet boundary conditions without lip 
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Fully developed profile Lip 


Fig. 4 Inlet boundary conditions with lip 


M/S 12 


N 
-y 


Fig. 5 Mean axial and turbulence velocities on symmetry axis, 


with modified inlet. 
Solid line = 20x20x20 grid. Broken line = 16x16x11 


grid. Star = mean axial velocity, Diamond = ¥2k/3. 
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was necessary because of the coarseness of their 9x9x9 grid. 

A similar procedure has been carried out here, by using the 
facilities of FLOW3D to insert thin walls in order to attach 

a small lip to the boundary of the inlet, as depicted in Fig. 
4. Coarser grids than in our earlier calculations were used 
in this case (16x16x11 and 20x20x20, uniform in the main flow 
direction, the inlet being resolved by 4 and 16 control volumes 
respectively). As seen in Fig. 5, the results are closer to 
the experiment than the previous results on finer grids. Since 
difficulties have been encountered here on finer grids without 
the lip, the problem is more likely to be due to inadequacies 
in the turbulence model. Indeed, as noted by McGuirk and 

Rodi (1979), a three-dimensional square jet very quickly 
becomes round, and the rate of spread of a round jet is over- 
predicted by 40% with the standard k-e model. This is a 
truly three-dimensional effect, which could not have been 
detected by a 2-D study, or even using a coarse grid in three 
dimensions. McGuirk and Rodi in fact recommend that one of 
the empirical constants in the turbulence model should be 
represented by a functional form which accounts for the transi- 
tion from a rectangular to a round jet. Using a fixed value 
of this constant appropriate to a round jet, recent calcula- 
tions indicate very good agreement with the experiment without 
recourse to any devices such as the lip. These results will 

be discussed in more detail in Burns and Jones (1985). 


6. CONCLUSIONS 


The numerical methods described, especially preconditioned 
conjugate gradients and its variants, have proved very useful 
in incompressible turbulent flow calculations. There is no 
"best' algorithm for all problems on all computers, but with 
a range of methods and flexibility to switch between them a 
reasonably efficient and reliable algorithm can be produced. 
This means that 3-D engineering calculations can be performed 
routinely and economically on reasonably fine grids, and 
predictions from turbulence models for 3-D flows may be 
investigated. 
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HIGH RAYLEIGH NUMBER SOLUTIONS OF THE BUOYANCY DRIVEN CAVITY 
PROBLEM à 
J.S. Rollett and S. Sivaloganathan 
(Oxford University Computing Laboratory) 


l. INTRODUCTION 


The practical importance of fluids to transport or remove 
heat needs little illustration. It ranges from the circulation 
of coolant through a nuclear reactor to the mounting of a 
power transistor on a block with cooling fins. Equally 
apparent is the importance of thermal processes in meteorology 
and various other branches of geophysics. 


We shall be concerned with flows in which temperature 
variations are introđuced by some mechanism independent of the 
flow dynamics; in particular we shall not consider flow induced 
temperature variations arising from adiabatic expansion or 
compression or from viscous dissipation. 


Temperature variations within a convective flow give rise 
to variations in fluid properties (e.g. density and viscosity) . 
An analysis including the full effects is still too complicated 
and some approximation is essential. We use the quations in 
a form known as the Boussinesq approximations, where all 
density variations are neglected except in so far as they give 
rise to a buoyancy force term (see Tritton (1977)). 


In section 2.1, we detail the governing equations and the 
finite element discretisation used. In section 2.2, we 
consider the buoyancy driven cavity test problem (see Jones and 
de Vahl Davis (1983)), we describe the general features of the 
flow, and note that as the Rayleigh number is increased, the 
numerical solution of the problem becomes increasingly 
difficult. Many of the numerical problems encountered are 
associated with the existence of singular solutions and in 
section 2.3, we present an efficient continuation procedure 
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for solution at high Rayleigh numbers and present some brief 
results for the velocity vector fields at various Rayleigh 
numbers. Finally, we conclude in section 3 with some comments 


and appraisal. 
2. THE BUOYANCY DRIVEN CAVITY TEST PROBLEM 


2.1 In the Boussinesq approximations, the equations expressing 
conservation of momentum, mass and energy are as follows - 

here and throughout the summation convention for repeated 
indices has been used (1 < n, m < 2) :- 


dul eu 
aN e Osea (Ra) (Pr) TS 2 =0 xen (2.1) 
m m 
du 
Beate xe (2,2) 
m 
dT 
oT m 
and MTeoR an ae xe Q, (2.3) 


where nies is the modified viscous stress tensor given by: 


ou, du, 
CAL otis Pon + (Pr) EEDE] (2.4) 
Toia si (245) 


and ont is the Kronecker delta. The non-dimensionalisation of 


Mallinson and de Vahl Davis (1977) has been adopted. Hence 
we T and P are dimensionless velocities, temperature and 


pressure, Pr(= v/k) is the Prandtl number and 
3 
Ra(= gaD (TK im TS) /kv) is the Rayleigh number, where v is the 


kinematic viscosity, k the diffusity, a the coefficient of 
volumetric expansion and g the gravitational acceleration. 
D is a characteristic length and T*, Ti are characteristic 


hot and cold temperatures. 
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Buoyancy flows are difficult to treat theoretically 
because of the coupling between the momentum and energy 
equations. The velocity field is governed by the temperature 
field but the temperature field depends through the advection 
of heat on the velocity field. It is impossible to determine 
one independently of the other as can be done in forced 
convection. 


The above equation system is now completed by a set of 
boundary conditions. These can be fairly general in nature, 
but for our purposes they are of the form: 


= = i 
ua (0) on 32 aay UIR, (2.6) 
T= on a2, (2,7) 
dTa =0 on dN. (2.8) 


Clearly a wide range of dynamical behaviours can be expected 
depending on the relative importance of the buoyancy force 
relative to the other terms. We consider one extreme case, 
where the fluid would be at rest in the absence of temperature 
variations, the buoyancy force being the sole cause of motion. 
This is known as free convection or natural convection. 


To construct finite element approximations, we require a 
weak formulation. The procedure is described briefly and can 
be found in more detail in Hutton (1974), Mitchell and Wait 
(1977) and Oden (1972). 


Let S (N) be the space of smooth vector valued functions 
u 
vanishing on the boundary 32, §,,(0) the space of smooth scalar 


functions defined on 2 and let Sp% be the space of the smooth 


scalar functions vanishing on part of the boundary 82, - Then 


we have the following equivalent integral formulation with 
solutions u, p, Ts.tueé S$, (2 res, (2) and (T - T) €S,, (2) 


and 


du ov 
n n $ 
pees ax — + P: my! "ax 6 i= 0 
fo Ya ox, Ya S + Sq Fam ox, axe (Ro r) fo ni ne 


VveS (2) (2.9) 
= u 


516 ROLLETT AND SIVALOGANATHAN 


du 
2m = 2.10 
Jaos . x dx=0 Vr e 5, (a) ( ) 
m 
or ow 
y E RAE A 2) Final ox, Vwe §,(2). (2.11) 


Since no derivatives of pressure appear and at most first 
order derivatives of velocities and temperature, the 
restriction to smooth functions can be relaxed and S (2) 


completed to include vector functions vanishing on 32 with 
square integrable first derivatives, likewise Sp (9) is 


completed to include scalar functions vanishing on part of the 
boundary an, with square integrable first derivatives and 5), (a) 


extended to include all square integrable scalar functions. 
The integral formulation is now the weak formulation of the 
differential problem. 


Finite element approximations. are now constructed in the 
standard manner (see Hutton (1974), Mitchell & Wait (1977)). 
The region 2 is covered with N finite elements a, (l<e<N) 


with a total of M nodes, M, of which do not lie on the boundary 


1 
on and M of which do not lie on the boundary aa, - At each 
node, a global velocity/temperature basis function Wy (x) is 
constructed. For our purposes, the LA (x) are modified 


biquadratics of the serendipity class (see Zienkiewicz (1971) ) 
which take the value 1 at the ith node and zero at all other 


nodes. 


. Each function W, (x) ,l<si< M is continuous in Q and 
since i > M for nodes on 3N, these vanish on 32. Therefore 
these functions can be used to generate an M= dimensional 
subspace of S$, (9) 


s,m -{a: a= z a, mw 


putting i, = 0 for M, < i < M, we can write 
i,m E = 


Wes Sets we CX) Is (2.12) 
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Likewise, using the above argument, we can use W, (x), 
i 


t EE Myr to generate an M,- dimensional subspace of Sr (Q) 


M, 
= = Db ~ 
i 3) SES Deke AE and the boundary condition 
l T T 


of (2.8) is approximated by 


Sp (2) 


à M A 
T= È T, W, (x) 
ih icin 
i=M, +1 
where Ti = Ti for M, <i<¢M, and Ti is the value of T at node 
i) then: 
ie Lhe 
T= 5 T, W(x). (2.13) 
i=l 


Due to certain theoretical considerations (see Girault and 
Raviart (1981)), the pressure approximations should be 
generated by polynomial basis functions of lower degree than 
those defining velocities. Thus a bilinear function x (x) is 


constructed interpolating 1 at the s-th corner node and zero 
at all other corner nodes. ‘The approximating. space 5am is 


then given by: 

~ M ~ 
s (2) ={p:p= = pX (x)}, 
P sl ss 


where the suffix S ranges over only corners nodes 


p= Ep X(x). (2.14) 


Substituting (2.12), (2.13) and (2.14) into (2.9), (2.10) and 
(2.11) the following discrete systems of equations emerges, 
non-linear in the nodal values of velocity and temperature 


a pis gina a, . + (Pr)A, Wee Ce ne: 
fin Pijkm kmt jn (Pr)A j mm?5,n (@Pr)A j mjm “sims 
- D, T, =0 (2.15) 

(Ra) (Pr) ijan j 
= c. u =0 (2.16) 


£ $ Rai 
i,3 is si,m i,m 
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= u T -A,, T =0 (2.17) 
E mac e A 
where : 
ðw ow w, 
Dr et) 
Aa hie Sg is Jim, at w, ax 
ij,mm a a ox, ijk,m Q i ox, k 
f ow, f 
c e eNO SDs { ww, dx ô . 
si,m a § ox, ijn R j J no 


Applying Newton-Raphson linearisation, on the rth step we 
obtain: 


pane aay "I 655i, n!P - (Ra) (Pr) D, paT oF ie 
sa Sajna ee 
Bijk,m a oy F Cy -Aij ary") i a 
where Ji nm = Bisa Ta + Bi Oe + (PEDALS op) Sam 
+ (PALS on: 


The (r+1) -th iterates for pressure, temperature and velocities 
are then obtained by: 


B (x41) _ 3!) S apo) A raD Pea] Ny ap (®) r 
s s s i i 
HREL ESE) ~(x) 
ah = UR + oats . 


The iterative sequence is started by initial approximations 


which can either be a creeping flow approxmation or a solution 
from a lower Rayleigh number. 


The governing matrix of the system can be considered to be 
composed of an M x M set of sub-matrices-of the following form 
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J* = c o o (2.18) 


Sij Kij 


where the zero block on the diagonal corresponds to the 
pressure variables. When using a Gaussian elimination strategy, 
pivoting is avoided by eliminating all velocity variables prior 
to pressure elimination. Fill-in-thus ensures a non-zero 
pressure pivot. 


2.2 The buoyancy driven cavity problem is summarised in Fig. l 
together with a typical finite element grid used. The square 
cavity has differentially heated side walls (hot and cold) and 
both horizontal walls are adiabatic. 


The buoyancy driven problem is a square region at a fixed 
Prandtl number of .71 was proposed as a test problem by Jones 
et al. (1979) for validating solution procedures for the 
Navier-Stokes equations. The principal features of the flow 
are well documented (see Winters (1980), Marakatos and 

3 
Pericleous (1984), Marshall et al. (1978)). For Ra = 10 , the 

4 
flow is virtually symmetric about both axes. At Ra = 10, we 
lose some of the symmetry, the flow concentrating towards the 
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5 i 4 > 4 
vertical walls, but by Ra = 10, the distortion is considerable 
with a weak two eddy structure in the core of the fluid. At 


Ra = 10°, the flow is concentrated in the boundary layers with 
a weak re-circulating flow in the centre. At the highest 
Rayleigh numbers, waves are generated in the lower region of 
the hot up-going boundary layer and the upper region of the 
cold down-going boundary layer breaking into a turbulent core 
and the cellular structure of the flow is lost. At these 
Rayleigh numbers the steady state equations cease to be valid, 
and we should switch over to turbulence calculations. 


2.3 When solving at a fixed Rayleigh number, we are at each 
stage solving a linear system of the form: 


J* ôx = - f where J* is given by (2.18) and (6x) "= (Aus Apt ar") 


which is equivalent to 
(3*)" (g*) 6x = = (a+) E ; (2.19) 


so the Newton Raphson technique is equivalent (for n equations 
in n unknowns) to Gauss-Newton winimization of the two-norm of 
the right hand side vector £ of the equations. It is evident 
when starting from a creeping flow approximation that Newton- 
Raphson and other optimisation techniques break down (due to 
the existence of false minima). Further, as the Rayleigh 


number is increased above 10°, the numerical solution of the 
problem becomes increasingly difficult. The continuation 
Procedure we have developed to solve at high Rayleigh numbers is 
Summarised in Figure 2. It is based on the following simple 
Observation of general Newton behaviour: 


(a) When the starting iterate is far from the true solution, 
there are commonly two or three cycles in which the shifts are 
of similar size. The error at the end of a cycle is hardly 
smaller than that at its start. 


(b) There is then a cycle in which the shifts are smaller than 
those that went before, by a factor between, say, 0.5 and O.l. 


(c) Finally quadratic convergence sets in, with shifts which 
are more than one order of magnitude smaller than preceding 
shifts. 
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SET: TOLERANCE VALUE TOL 1 
STARTING RAYLEIGH NO 
TARGET RAYLEIGH NO 

INITIAL ZKK VALUE 


CURRENT RAYLEIGH NUMBER 
RAY = (1.+ZKK)*ROY 1 


SOLVE SYSTEM FOR CURRENT 
RAYLEIGH NUMBER 
PERFORMING ONLY 2-N-RS 


YES 
IS RAY > ROY 2? 


IS RATIO < TOL 1? 


REDUCE FACTOR ZKK 
ZKK = .5*ZKK 
DO NOT ADVANCE ROY 1 


ZKK*TOL 1 , 2*ZKK 


RATIO*2 
ADVANCE RAYLEIGH NUMBER: 
ROY 1= RAY 


[SET = nov af SOLVE TO HIGH ACCURACY RAY = ROY 2 


Fig. 2 Flow Diagram of Continuation Strategy 


SET ZKK = MIN 


At stage (a) we are close to the size of error at which 
the process may not converge. We must avoid this. At stage 
(c) we are changing the parameters so little that we are not 
making progress efficiently. We wish to operate, so far as 
possible, at or near stage (b). We can use the ratio of 
second Newton shift to first Newton shift as a good indicator 
of whether we are in the region of "safe convergence". In 
essence, this means that we are tracking the solution ina 
tubular neighbourhood of the true solution curve. 


we do not find that we have to reject many 


In practice, 
thm tends to shorten its step length 


steps, since the algori 
well before any problem is encountered. 
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On a (5 x 5) uniform finite element grid the algorithm slows 


down for Ra > 10°. We have tracked the determinant of the 
Jacobian along the solution curve and have observed that a 


singularity occurs at ~ 1.26 x lo’. Further at any Rayleigh 


number > 10°, spurious (non-physical) features are observed in 
the velocity field solutions (see Figure 3), strong 
re-circulations being observed near the vertical walls. 


+5 


Ers 


RAYLEIGH NO= 1000000 
SCALING FACTOR= 0.0010 


Fig. 3 
Mesh refinement alleviates the problem to a certain extent. 
On a (9 x 9) uniform mesh, the strong recirculations near the 
vertical walls are not observed at Ra = 10° (see Figure 4), 


these now occurring at the higher Rayleigh number of 10” (see 
Figure 5). 


+ 
nannan nnna 


ats tS 
uuu nnuuuna 


RAYLEIGH NO= 1000000 
SCALING FACTOR= 0. 0005 


Fig. 4 
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RAYLEIGH NO= 10000000 
SCALING FACTOR= 0. 0003 


Fig. 5 


A series of tests on successively finer meshes shifts the 
singularity from a Rayleigh number of ~ 1.2x107 (ona5x5 


grid) to a Rayleigh number of ~ EBSO (on a 9 x 9 grid). A 
non-uniform 5 x 5 grid (with narròw elements near the walls) 


shifts the singularity to ~ 10°, and a non-uniform 9 x 9 grid 
shows no signs of the singularity up to ~ 8 x 10°. Figure 6 is 


a solution obtained at a Rayleigh number of 108 on a non- 
uniform 9 x 9 grid. We need much more mesh refinement 
(especially on the corners) to pick out the general flow 
features, and in any case we are now well into the turbulent 
regime and should switch to solving the turbulent equations. 


RAYLEIGH NO= 10000000 0 
SCALING FACTOR= 0, 0001 


Fig. 6 
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In practice, the continuation procedure has proved an 
efficient automatic procedure for obtaining solutions at high 
Rayleigh numbers. Our objective has not been to obtain highly 
accurate solutions (which would require much finer grids and 
very much finer refinement in the boundary layers than we have 
used), but rather to obtain solutions on relatively crude 
meshes which reflect the general features of the flow, and that 
can be used as initial guesses for finer grid solutions at the 
same Rayleigh number (thus providing a substantial saving in 
computing time and efficiency). This method of continuation 
in Rayleigh number has also been extended to continuation in 
Prandtl number and is generally applicable to continuation in 
any parameter. 
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A TECHNIQUE TO SOLVE INCOMPRESSIBLE FLOW EQUATIONS 
COUPLED WITH A STRESS EQUATION 


M.F. Webster 
(ICFD, Department of Mathematics, University of Reading) 


1. SCOPE OF PAPER 


A numerical scheme is presented which solves equations of 
Navier-Stokes type for steady planar two-dimensional 
incompressible flows. The flow variables considered are stream 
function y, vorticity w and stress >. The equation system to 
be solved is then stated in non-dimensional form as 


vy = -w, (l.la) 
u= Ry- Yw + Wu. Yt, (1.1b) 


with the definition of velocity u = Y x Ņ = i i m and 


where R is a Reynolds number and W a stress parameter (W is a 
Weissenberg number for the Non-Newtonian fluid model). Equation 
system (1.1) is completed by a stress description equation. 
Attention has initially been directed towards a stress term 
that arises in a simple Non-Newtonian fluid model where 


Ey, (1.2) 


This description is derived from the so-called second-order 
model. It is an overall aim that the techniques proposed 
should be sufficiently general to find application in any flow 
situation that may be described by a coupled system of the 
incompressible flow equations and a stress equation system, 
eg. some models for turbulent flows and buoyancy driven flows. 


The full system of equations (1.1) and (1.2) is considered in 
a form with (1.1b) replaced by 


Wy. Vo - ¢ = -Ru. Yù, Wn) 
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representing two elliptic Poisson equations and a hyperbolic 
equation. 


It is discretised by finite difference methods on a uniform 
grid: the usual five-point operator is used for (l.la) and 
(1.2) with (1.3) replaced by a Crank-Nicolson type scheme. The 
resulting system of (nonlinear) equations is solved iteratively 
in a sequence of Picard-type iterations at the outer or 
nonlinear level. At the linear or inner level a combination of 
inner iterations for the elliptic equations (SOR) and direct 
marching scheme for the hyperbolic equation is used. The 
convergence of the iteration and the stability of the 
difference schemes are analysed for a forward-facing step 
problem. Results are given for various model problems covering 
a range of values for the two-parameter family (R,W). 


2. HISTORICAL BACKGROUND 


The full system of nonlinear equations is solved iteratively. 
This paper concentrates on a traditional approach of 
linearisation by decoupling, introducing a set of linear 
equations and an outer Picard-type iteration. The present 
iterative scheme is compared to two similar iterative schemes 
proposed by Crochet and Pilate (1976) and Davies (1983). With 
integer n 2 0 indicating an outer iteration number these 
schemes may be summarised as follows: 


Wu Y(U WTL) + Ru".yu"2 - v2u"3 = 0, (2.1a) 
+ 
vye 1 Ni antl aioe: anes w y ad a (2.1b) 
where for Scheme l (Crochet) Sees, Xo) a tan n+l; Scheme 2 


(Davies) gjuan, ry n+l, R = O; Scheme 3 (Present) rp 


n+l, r, =n, 

Scheme 1 yields a third-order differential equation for w 
leading to convergence difficulties in the corresponding inner 
iteration. Converged solutions of the outer iteration were 
reported for the (R,W) values of {(1,.1), (10,.2), (100,.4), 
(500, .8), (1000,1.4)}. A critical upper limit on W was 
observed for each selected R value, though this limit increased 
with increase in R, Scheme 2, with R = O, gives an efficient 
inner w iteration but W is effectively limited by the numerical 


smoothing of the source term wa. (V un) + Converged solutions 
were reported for (R,W) of {(0,.1), without filtering; (0,W), 

W < 10 with filtering}. The third and present scheme differs 

from schemes 1 and 2 by the inclusion of a three-step outer 
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iteration, and the solution for vw at the linear level by a 
direct marching scheme. No upper limit on W is found for 
converged solutions for R < 10, though scheme 3 is dependent 
upon restriction of R. 


The dependence of the outer iteration on R and W may be 
investigated through a linearised perturbation analysis using 
a single Fourier mode (see Morton et al., 1985). It may be 
deduced that low frequencies, (i.e. long wave-lengths) i, 
give convergence difficulties for all three schemes. Scheme 2 
also suffers at high frequencies leading to a bound on 


Ikl = wi (see Tanner 1982) and convergence difficulties with 
increasing W. For scheme l at fixed R, as W increases from 
zero a limiting value may be predicted W« R else which will 


increase with R. The present scheme presents much less severe 
restrictions on W for small R, but the situation is expected 
to deteriorate more rapidly with increasing R. 


Acceptability of converged solutions for small R is based 
upon the Tanner/Geisekus theorem, (see Tanner 1966), for 
creeping flow (R= 0). At W= O the solution for ¢ is trivial, 


t= vie = O, and from Tanner's theorem the velocity field 
fulyio also satisfies the problem VW > O. 


3. FULL STATEMENT OF MODEL PROBLEM 


The solution of the forward-facing step or contraction 
geometry is considered for the model equations discussed 
earlier and used in the following form: 


wegen pH. -Ry Jw", (3,1a) 
gent = ott L (3.1b) 
gantl on ut patla g yt a avat (3,16) 


The region of solution is shown in Fig. l: ABCD is a fixed 
boundary, FE a symmetry boundary, AF the inlet boundary and DE 
the outlet. The flow is assumed to be fully-developed at inlet 
and outlet. With, firstly, a statement of the minimum boundary 
assumptions required, followed by the implied conditions 
actually used, the appropriate boundary conditions are 
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Fixed boundary ABCD 


u=v=0 =y-0,u=-2%,¢=0; (3.2a) 
on 
Inflow boundary AF 
u given, v = 2 =O =y}, w given ; (3. 2b) 
Outflow Boundary DE 
ao poat Iida =O; 
v=0, E o ea I ofr ae ae) OF (3.2c) 


Symmetry boundary EF 


5 =O, v=O = W = constant, w = 0, t = 0. (3.2d) 


For given values of R and W, the last remaining parameter in 
the problem is the step ratio AF:DE taken as 4:1. 


Commencing from W = O for (3.1) and introducing small w>0O 
involves consideration of a singular perturbation problem. 
One may expect boundary layers for ¢ to result dependent upon 
the boundary conditions imposed. If a zero value of ¢ is 
imposed at outlet then boundary layers are avoided. This is 
consistent with the exponentially decreasing complementary 
function form for ț¢ when treated as ODE along the streamlines 
in the upstream direction. 


Fig. 1 Schematic flow diagram 
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4. NUMERICAL APPROXIMATION 
4.1 Finite Difference Equations 


The region 2 of Fig. 1 is covered by a regular square grid 
of side h and nodal values of the variables y, w, Ç are denoted 
by 455 etc. at nodes x = ih, y = jh. Values of h of 2 and ie 
are used, with some 1400 mesh points and 16 mesh lengths at 
inflow for the former choice. Use is made of standard 


difference operator notation as follows: 


Sieg Mae g 7 Mag 7 MaMa 7 aay tay? ee) 


with similar meaning for oy and ithe The difference 


approximations for the Poisson equations for w and y (3.1b,c) 
are then 


2 2, carl 2 n+l 
6 +6 2 =h x 
g Y ; aan Sui 
the standard five-point schemes: here superscript (n+l) 
denotes the stage of the outer iteration process, 


ntl y wth yore 


EARS Si n2untt X (4.2) 


2 
7 (6 ô ete Ati 
( A y*ij i,j 


Two different schemes are used for the stress equation 
(3.la): the box scheme in the recirculating flow region a and 


the Crank-Nicolson type scheme in a where the flow is 


predominantly in the x-direction. For the former, velocities 
are required at the centre of each cell and are given by the 
four-point formulae 


n ps) 5 n NY s 
Wage Pa Oy Vits je ' Y ita jts TT By Ox Vita, jth ° 
h h (4.3a) 


The box scheme approximation to (3,la) is 


naie ERO E - hlouu g + (1-0)5 TA 
eee TREN i4, j+ R y ite, ji i,j+l 

(4.3b) 

VANE n n n] 

F afo US tY Spe Jits j+s 


where the parameter @ permits weighting within the scheme on a 
pointwise basis and ensures stability for all possible velocity 
fields. The Crank-Nicolson scheme covers two neighbouring cells 
with the same x-coordinates and is therefore centred at (i+4,j): 
thus the velocities are given by the unsymmetric formulae 
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n n n n 
= .=-6 R 37, (4.4 
Yi j ies Lag ee Vits j A Yis, j x Yirs,j ( w 
h h 
and the approximation to (3.la) is 
n n n+l ent] + 
[wo ôa +v HHS) E - ht ae = 
n n n 
2 6 i 4.4b 
alu ôg +v uh WG RE ( ) 
re = 4 : 4.4 
where Cua, Bos + (1-8) čij (4.4c) 


The weighting parameter B is introduced to again ensure 
stability of the scheme for all velocity fields. 


4.2 Boundary Condition Implementation 


For the stream function 4° in (4.2), Dirichlet boundary 
conditions apply over the whole boundary except DE, from 
(3.2a,b,d). From (3.Zc) Neumann conditions apply on DE (same 


for w") and are incorporated in the following fashion: 


n En 
de te Hale E arene 


Fig. 2 Mesh near boundary 


3 A 
Likewise for w in (4.2) Dirichlet conditions apply on AF and 


EF from (3.2b,d); only the specification of a” on the wall ABCD 
remains. This approximation is achieved by Taylor series 
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expansions based on local wall velocity conditions. With 
reference to Fig. 2, use is made of the following derived 
formulae: 


n -2 n-1 
E W: a 
Hin Le an SE ee (4.6a 
u =- ah y e eaa (4. 6b) 
i,J i,J+1 EERS ? i,d+1’ j 
n tle -2 fy n-1 
pat E tey PLN T a o 6S) 


where I and J are wall coordinates, (I,J) is the re-entrant 
corner point C, and r in (4.6b) may be chosen at level n-l or 
n. Formula (4.6a) is formally first-order accurate, see (Thom 
1933); formula (4.6b) is second-order accurate, see (Woods 
1954), and is used with r = n at all wall points bar C; 
formula (4.6c) is attributed to Kawaguti (1965) and provides a 


finite estimate of w based on the first-order form. At C, w 


is unbounded; however the simple formula (4.6c) generates an 
estimate closely approximating that provided by an asymptotic 
expansion matching scheme (see Holstein and Paddon 1982). 


1 


For stability reasons the equations for rai in (4.4b) are 


integrated from outflow to inflow, and from (3.2a,c,d) garl is 
set to zero on all boundaries, except on the inflow boundary 
AF. Fully-developed conditions at inflow provide a 
compatability check on the solution generated and for 


consistency gn should decay and vanish there. Providing the 
exit length CD is sufficiently large no boundary layers are 
encountered, and in fact with R = O, then ¢ = 0 VW20 as it 
should be. If the Crank-Nicolson scheme (4.4) were to be used 
over the whole region a sufficient data would now be available 


with lwp j specified at all interior and boundary points: 
' 
the same is also true if the box scheme (4.3) is incorporated 


in a limited zone a based on ABC but not extending across to 


the symmetry boundary EF. 
5. SOLUTION PROCEDURE 


The nonlinear equation system is decoupled into a system of 
three linear equations. A three-step outer Picard-type 
iteration is introduced where each step constitutes the solution 
of a linear equation at the inner level. The process commences 


with the iterate (yw WV) and the order of computation is to 
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first calculate Bie. from which oe is generated, and finally 


aie is obtained from pits The cycle may then be repeated 


n+l 
after recomputation of the new velocity field iterate u v 


+ +1 
The difference equations (4.2) for w F and y> are solved 


using a secondary inner SOR iteration with empirically 
estimated optimal relaxation factors of 1.6 < Py < 1.8 and 


Pi = 1.0 (as in Davies et al., 1979). 


The stress oe is calculated in the following manner. 
First, the box scheme (4.3) is introduced for the recirculation 


zone a, Proceeding in a point-by-point fashion starting near 


h 
n+l 
B, REAT is given by 
n+l n+l n+l n+l MON 
CARN Sabai S E gaat SART aa (Se) 
= (30-4)2 — wiyey)” y aga n 
where c, = (36 4)5 W(u Vij Ca at W(u + Vat, jah! 
m= 2, 3, 4; andO < 0&1. It is vital that ar is restricted 
to the recirculation zone defined by the known separation line 
n+l 


value of y. Otherwise, large numerical oscillations in ¢ 
occur cross-stream and are subsequently swept upstream. This is 


due to the sign switch of coefficient cr when v begins to 


dominate u°. The factor @ may be altered from the average 
value of unity to the totally weighted zero value dependent 
upon a pointwise selection criterion; its choice guarantees the 


tabil E th iati -v) 
s ility o; e scheme for all variations in (u Via 
Second, the Crank-Nicolson scheme (4.4) is used in a involving 


a direct line-by-line marching procedure from outflow to 
inflow. A tridiagonal matrix of equations results for each 
n+l 


$ n+l 
re 
line and or is related to rey by 
v ge ARa - S 
[ C41 ( (1-8) ) by real 


ntl 
[y tja + (O-BR)C, - veja] E E (5.2) 
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Woon n 
where V = 7 V. F . 
4 VI+, j MTs, j 


is used to solve the system of equations (5.2). Numerical 
stability of this solver is guaranteed since the associated 


and U = W A fast tridiagonal solver 


matrix is diagonally dominant almost everywhere in a, i.e. 
i n n 
rovided > = A 
provide: EA k eni J B=0. Froma Rone 1-D 
analysis of simple channel flow, the condition 8 £ h must be 


satisfied to avoid streamwise oscillations in the implementation 
of the Crank-Nicolson scheme. Furthermore, from a discrete 
Fourier analysis of the line-by-line marching sweep, the 


stability criterion ß < z + 4 emerges. Both conditions are 
satisfied and stability affirmed if B = O and ans j 2 0, 

, 
(see Morton et al., 1985). The Crank-Nicolson scheme is only 
applicable for a zone predominantly in the x-direction, where 
streamlines can effectively be tracked, If it is extended to 


1 


x 
a a large cross-stream oscillation in “ag is observed due to 


small velocity field values, and leads to a failure to converge 
in the Picard iteration. 


2 2 


Relative tolerances in[t, w, y] of [10 ^, 10^, 1073} are 
used to monitor iterative convergence: smoothing of one outer 
iterate with the next is also used. It is generally observed 
that as R is increased, convergence criteria become 
increasingly more difficult to satisfy. tne convergence of the 
Picard iteration is directly related to R: for small R 
convergence may be obtained for any W, whilst for larger R 
(greater than 10) convergence of the Picard iteration 
deteriorates for larger W. These conclusions are in agreement 


with the arguments of section 2. 


A selection of results is presented in Figs. 3 and 4. For 
-4 
creeping flow (R = O) ¢ vanishes Vw2o. For R=10 


= 2 F 
solutions are found for 10 = < W< l0“: ¢ behaves like Ru. Vw 


for W < = and like Bo for W > = , where U is a characteristic 
velocity value. The same relative behaviour for > is observed 
for R g l. The solutions for t for O.l < Ws loatR=1 and 
R = 10 are shown for comparison in Fig. 3. For completeness 

the solutions for y and w are also given at W = 1 for R = 1 and 
R = 10 in Fig. 4. Solutions for p and w change negligibly with 


W for R<l. 
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Fig. 3 a-c Stress contours 


INCOMPRESSIBLE FLOW PLUS STRESS EQUATION 


(d) 


2 Ei 
R=10.0, W=0.1 


(e) 


2 4 
R=10.0, W=1.0 


R=10.0, W=10.0 


Fig. 3 d-f Stress contours 
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Fig. 4 c,d Stream function and vorticity contours 
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6. EXTENSION FOR A TURBULENCE MODEL 


An initial proposal to utilise the techniques already 
outlined is to consider the k ~ 2 one equation turbulence 
energy model as the stress description equation replacing ¢ 
(see Thomas et al., 1981). The model variables are then 
turbulence energy k, w and y and the model equations are 


x ðu, du ðu, 


(ueY + Yk”) k = YB.Yk#BVÝk + k's, s=s+ +g (6.12) 


əx dk” 
J L 


i] 


1 


2 
u. Vw - DWw v. (Da Yyy) + aD We 2D Wyt N a A a 


Vpe-w, uaVxy, (6.1c) 


where D = rt + BK? 7B = ae + Bo`lk” ve oes 
ARE > O; and B(x) is the length scale given by an 


assumed algebraic specification. S is noted to be a positive 
definite field variable. 


The treatment of (6.la) for k is similar to that for Ç. The 
positive coefficient y indicates a switch to marching in the 
flow direction and there is now an additional term V. (BVk) to 


augment the scheme. With reference to Fig. 5a, Sa at point 
Py is calculated by the Crank-Nicolson scheme as follows; 


n+l n+l n+l n+l _% 
ak uses (ko kg) ; af uses (ky + ky, kyr ko? f k'k 


is replaced by Ca raia , and YB,B and bk 4s also use 


(k^); ave uses Kt at Pye Por Par Par Por Por Po and 

Ke at Po. A tridiagonal system again emerges with unknowns 
(ky, koe ra at each step. Next with reference to Fig. 5b, 
the box scheme is used to calculate ss within a recirculating 
flow region. Thus mn is obtained using (kor k3, a and, 


additionally, (k, k)” for BYk. 
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xo 


(b) 
Fig. 5 Finite difference scheme stencil 


Boundary conditions follow as before except for k and w at 
the wall. Here the velocity field near the wall is used to 
derive such boundary conditions based on the universal 
logarithmic law of the wall and matching with a viscous 
sublayer, (see Turner, 1973). This necessitates the 


calculation of the wall shear stress THALL from the known 


velocity field u using the relationships 
aN ee ks 
“are Rann 7) OW Cy | ee 


Also at attachment and separation points the normal derivative 
of k at the wall must vanish. 


(6.2) 


The approach adopted shall be to solve laminar flow for 
R < 2000, using for the vorticity equation the centred 
difference scheme of Dennis and Smith (1980) with exponential 
coefficients. Beyond R = 2000 the switch to modelling 
turbulence with the proposed scheme can then be attempted. 
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l. INTRODUCTION 
1.1 Review 


Starting from the Helmholtz decomposition of a given vector 
field u into 


u=VotV xy 


where b is a scalar potential and p a stréam vector, we propose 
to study the efficiency of such a decomposition for solving the 
steady compressible EULER equations; in that approach y is 
introduced as a correction to the transonic potential flows. 


Let u be any solenoidal vector field (V.u = 0), it is well 
known "see (De Rham, 1961)" that u is the curl of a stream 


vector jp: 


u=Vxyp. Ged) 

In the 2-D case with suitable boundary conditions, is 
uniquely defined from u and it has only one non zero component; 
this property has been introduced, among other authors, in 
(Amara, 1983) to calculate a correction for isentropic 
potential flows. 


The ideas of Amara (1983), Ecer & Akay (1982), Grossman (1983), 
Lacor & Hirsch (1982), Sokhey (1980) and Po aa E N 
have been extended in (2-D) and (3-D) for general in 
flows by El Dabaghi & Pironneau (1985) and good AEn oy 
been obtained in subsonic rotational case by El Dabaghi (1984). 
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1.2 Notation 


V : Gradient A: Laplacian V-: Divergence 
Vx : Curl (u@v).,=u, v, ux v = Cross vector product 

ij peal between u and v 
Q : Bounded domain T = Tin oute : Boundary of 2, 

n its outward unit normal 

u : Velocity ww : Stream vector $ : scalar potential 
p : Density p : Pressure E : Total energy 
H : Enthalpy s : Specific. entropy S : Modified entropy 
w +: Vorticity y : Cp/Cy : Specific Heat ratio (1.4 in air) 


2. THEORETICAL RESULTS 
2.1 Notations 


Q is a bounded open set of R™(m = 2,3), simply connected 
with boundary I € cr ET aati 


on T we define n its outward unit normal. 


connected component of T); 


(a,b) denotes the inner product in L° (9N), and | he 


associated norm. 


dloa t 


ai 
H (9) /R = ula) quotiented by the constants. 


1 1 
H (9) = {vE H (Q): vip = o}. 


2.2 Decomposition theorem 


We will state the first result on Helmholtz decomposition 
which is a straightforward generalization of Bernadi (1979). 


Theorem: Let u be a given vector field of a 0). Let ġġ be 
the unique solution in Hl (Q) /R of 
ih 1 
(Py) (Vo, Vw) = (u, Ww) Vweur (Q) 7, A 


Let y be the unique solution in #3 (a) of 


(a) (WW, Ww) = (ur Vw) Vw atiay, 


Then 
u = Vo + Vay . (2.1) 


Proof "See (El Dabaghi, 1984)". 
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3. NUMERICAL INDUSTRIAL ASPECT 


3.1 Motivation 


= 


Shock 
Surface 


ue 


Fig. 3.1 


(For simplicity, only a portion of wing between two walls is 
considered on Fig. 1). 


Around non lifting bodies, for example, flows described by 
V.(pu) = 0 (3.1) 


are well approximated in transonic regimes by the isentropic 
gas law assumption: 


1/Y-1, 


2 
vega - ({ul%72) Vo] =O, (3.2) 
where H is given and we assume 
u = Vd and w = Vxu = 0 (3.3) 


Equation (3.3) is valid only if we suppose that the 
entropy S is constant or if the jump is close. to zero across 
the shock. 


However when strong shocks are developed in the flow, S is 
no longer constant and the jump of S is no longer small and 
therefore the assumption u = Vd fails; also in order to 
generalize the potential approach of Glowinski et al (1984) at 
high transonic regimes, we introduce a Helmholtz decomposition 


of u: 


u = Vo + Vxp (3.4) 


in which the second term of the decomposition can be 
interpreted as a correction. 


546 EL DABAGHI ET AL. 


3.2 Steady Euler Equations for Inviscid Flows 


In order to solve the Euler equations via a (¢ , ) - 
decomposition of the velocity, we recall at first the well 
known fundamental equations: 


V. (gu) =O (Mass conservation) (3.5) 
V. (pu @ u) + Vp =O (Momentum conservation) (3.6) 
V.(pEu + pu) =0 (Energy conservation) (3.7) 


To these equations we add the following thermodynamical 
relations: 


p= p s (State gas law) (3.8) 


V. (osu) =O (Second thermodynamic principle) (3.9) 


where 
AREP. og (3.10) 
E =H- p/p (3.11) 
= 5 lu^ (r/y-2) (w/e) - (3.12) 


Now in order to derive a convergent (¢-)- algorithm, the 
main idea is described as follows: assuming the velocity known, 
from the evaluation of w, we calculate y and then we add 
V x Ņ to Vo by a fixed point process to obtain a new value of 
u; more precisely (3.4) yields: 


VxVxv = w ; (3.13) 


assuming ) solenoidal and adding suitable conditions on Y, 
then y can be computed by (3.13) if w is known. 


Equation for w 


Developing (3.6) and combining (3.5), (3.8) and (3.12), we 
obtain the following relation 


p VE ~ pu xw - clp’/y-1)vs = 0; (3.14) 
then resolving the cross product of u with (3.14) and using 


the unknown scalar function À (very close to helicity) 
defined by: 


As a.u/p|u]? (3.15) 
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we obtain finally the following equation for w: 


we Ap us uju x y- YS - YH) (3.16) 


where à, H, p and S are to be determined. 
Equations for A, H, p and S 
It must be remembered by the definition of w that 
Vw=O.- (3.17) 


Therefore combining equations (3.17) and (3.5) with (3.16) 
leads to the transport equation for À: 


pu.VA = -vL lala x (p "/v-1)vs-va], (3.18) 
On the other hand replacing (3.11) in (3.7) and using (3.5) 
we obtain also a transport equation for H: 


u.VH = O (3.19) 


and taking the inner product of (3.14) with u, we find also a 
transport equation similar to (3.19) for S: 


u.VS =O (3.20) 


where H and S can be computed respectively from their value 
at upstream infinity: 


H (2 (x)) (3.21) 
So (2 (x) ) (3.22) 


H (x) 
S (x) 


where Ho and S, are respectively the given enthalpy and 


entropy on TI, and Z(x) is the upstream intersection of ri 
in n 


with the stream line that passes at x. 


Finally an equation for P can be derived from (3.12): 
1 


L 12,7 yn 
p= yea @- 5 lel”) : (3.23) 
Ys 
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At this point as u,H,A,S and ĝ are known w can be evaluated 
from (3.16), then the rotational correction can be computed by 
solving (3.13) and finallyu can be updated by (3.4) by solving: 


V(pV$) = - Vp.Vxp (3.24) 
Remark 3.1 


From the above considerations, it appears that the master 
piece of the associated algorithm, in order to solve the 
Euler equations via a (¢,~)-formulation, is the modified 
transonic equation (3.24). 


Boundary Conditions 


In order to solve (3,24), we prescribe on T Neumann 
boundary conditions corresponding physically to slip 
conditions: 


3$ 


E ae 


Ir (3.25) 


which imply a non standard boundary condition on y added to 
(3.13): 


(Vx) “Alp = Ọ; (3.26) 


in fact the boundary condition (3.26) will be replaced by a 
natural one: 


Yir =0 in the 2-D case (3.27a) 

vn p = O in the 3-D case. (3.27b) 

Considering the above boundary conditions and the nature of 
the flow (shocks can occur in transonic regimes), the 


following remarks can be made. 


Remark 3.2 


If we deal with a homogeneous boundary conditions on ġ of 
Neumann type: 


E] 
z Ir =0 (3.28) 


(3.28) involves a non homogeneous boundary condition on Ņ: 


(Vxy) Ap = Ueny, (3.29) 
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which implies on IĮ the solution of a Laplace-Beltrami problem 
in order to find Yip "See (El Dabaghi & Pironneau, 1985)". 


Remark 3.3 


Equations derived for w,A,S and p are not valid across 
shocks: we have to add locally, where discontinuities occur, 
the Rankine-Hugoniot conditions detailed in (El Dabaghi, 1984) 
in order to compute the jump of the above variables across 
shocks. 


For the following section we denote by [v] the jump of a 
variable v across shocks. z 


3.3 Fixed point algorithm 


In order to compute the rotational correction of a 
potential flow (described in section 3.2) we consider the 
following algorithm: 


Step O Initialization 


yo 0, S°=S r H°=H,, A°=\_(#: at upstream infinity) 


Compute a first guess for w=V9° and °° by solving the non 
linear transonic potential equation (3.2). Then for m0, 


+ + +l mtl m+l ,mtl 
eee AEE 1¥ a) 


assuming u” known, compute S ' 


oe and ae by the following: 

Step 1 Solve SORGT = 0O in fN, except at shocks. 

Step 2 Solve image =Oin dQ 

Step 3 Solve 

wee oa Ve fas] ut?) x ttot V4 /y-1) vst - vaT™) j 


in 2, except at shocks. 


Step 4 Compute ere by (3.16), 


+l m+l . 
Step 5 Solve vxvxy™ =W in Q 


with the boundary conditions (3.27) . 


+1 m m+l , 
Step 6 SolveV-(p Ve) =- Vp -Vxy in & 


with the boundary conditions B25 


550 EL DABAGHI ET AL. 


es 
Step 7 Update ai Ti by (3.4) and a” z by (3.23) m = mtl, go to 


Step 1, until convergence of the entropy correction. 
Remark 3.4 


i) The above algorithm (Step 0-7) operates for 2-D and 3-D 
general situations with finite elements. 


ii) In Step O and 6 solutions 4° and ere are computed by 
least squares formulations using preconditioned conjugate 
gradient algorithm; physical shocks are obtained by upwinding 
of the density in the flow direction "See (Glowinski et al. 
1984)". 


iii) Transport equations of Steps l, 2, 3 are resolved by 
characteristic methods using finite elements detailed in 
(Pironneau, 1982). 


iv) Concerning the boundary conditions involved in step 5, 


the 2-D case is solved by a classical p? finite element method 
but in the 3-D case, the situation is more complicated and a 
general recent study concerning the determination of y with 
numerical results can be found in (El Dabaghi & Pironneau, 1985). 


v) In the 2-D case, 4 = 0. 


vi) For the numerical experiments we consider an isenthalpic 
fluid (H = Constant). 


4. NUMERICAL EXPERIMENTS 


In this section we present the results of a first numerical 
test using the method discussed in the above section. It 
appears from the experiments that the method is well suited to 
obtain in a efficient way, a transonic Euler solution as an 
entropy correction of a potential flow. 


4.1 Transonic simulation of flow around a NACA 0012 airfoil 


We have considered a transonic flow around a NACA 0012 


airfoil at angle of attack a = 0° and Mach number at infinity 
M, = 0.85. 
o 


Fig. 4.1 shows a typical triangulation with enlargement near 
the body, made of 2354 nodes and 4568 elements. As an initial 
guess, a transonic potential flow with shock is computed, then 
the localization of the shock allows one to start the entropy 
correction process and the Euler solution is reached in a few 
iterations (Iter < 5). 
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Fig. 4.2 shows the Mach lines around the airfoil of the flow. 
We have presented on Fig. 4.3 the entropy lines created across 
the shock via the Rankine-Hugoniot condition and then 
transported downstream by the characteristics method. The 
vorticity in the flow created behind the shock is also plotted 
on Fig. 4.4 and we can observe a good symmetry of the correction 
stream vector on Fig. 4.5. 
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Fig. 4.5 
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The shock location and the entropy level have been 
compared with an Euler solution entropy value on the skin of 
the airfoil, it appears to be in good agreement on Fig. 4.6 
with the results obtained by Lerat (1981). 


© Finite Elements 
4 Lerat 


Fig. 4.6 


Finally a convergence history of the solution is shown in 
Fig. 4.7 and we can observe that only 3 to 5 iterations are 
necessary to reach convergence which gives a CPU time ratio 
(3 to 5 units of time to get the potential solution) compared 
to the CPU time of a full Euler solution. This fact makes the 
correction method attractive for further 3-D applications. 


—> 
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4.2 Further comments and conclusion 


We have presented in this paper an entropy correction 
method based on the velocity decomposition for the numerical 
solution of the Euler equations modelling transonic flows. 
With this method we can compute finite element Euler solutions 
via an efficient iterative process including a nonlinear least 
squares formulation for the generalized full potential 
equations and also a method of characteristics for the transport 
equations in order to compute the stream vector correction. 


The modularity of the associated algorithm allows the 
introduction of local entropy corrections in selected 
subdomains of the flow. Further more complicated 3-D 
applications of industrial interest (inlets and complete 
aircraft) combining the above ($,W)-formulation and domain 
decomposition methods "See (Dinh et al., 1985)" with finite 
elements for lifting and/or strongly rotational transonic 
flows are considered and performance evaluations will appear 
in a forthcoming paper. 
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SEMI-INVERSE MODE BOUNDARY LAYER COUPLING 


S.P. Newman and P. Stow 
(Rolls-Royce Limited, Derby) 


1. INTRODUCTION 


The effect on the surface boundary layer on an inviscid 
blade-to-blade flow is in general important in terms of 
blockage and flow deviation and consequently the boundary layer 
and inviscid calculations must be coupled together iteratively. 
There are two main procedures for modelling the effect of the 
boundary layer on the inviscid flow, a displacement model or a 
transpiration model. In the former the blade geometry is 
altered by the boundary layer displacement thickness and the 
inviscid flow is calculated around the modified geometry. With 
the transpiration model transpiration of mass, momentum and 
energy through the blade surface is used to model the effects 
of the boundary layer. Matching conditions at the edge of the 
boundary layer determine the transpiration parameters to be 
used. The choice of model is often determined by properties 
of the inviscid method being used; for example the 
displacement model being adopted with methods where the use of 
a boundary streamline is essential eg. a stream-function 
approach or the streamline curvature method. With a finite 
volume or finite element inviscid method the transpiration 
model is easily incorporated. 

With either a finite difference or an integral boundary 
later method the iterative coupling with the inviscid mainstream 
calculation needs to be given special consideration irrespective 


of whether a displacement or transpiration model is adopted. 

In cases where the boundary layer remains attached and a second 
order effect then direct mode coupling can be adopted; in 
other cases inverse or semi-inverse coupling needs to be used. 
These are discussed below in the context of a full potential 


blade-to-blade method that adopts a finite element solution 


procedure. 
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2. BOUNDARY LAYER INCLUSION 


The inviscid calculation adopted in this paper is a finite 
“element full potential flow method presented for two-dimensional 
flow by Whitehead and Newton (1985) extended to include quasi- 
three dimensional effects by Cedar and Stow (1985a). 


A transpiration boundary layer model is adopted using an 
integral method based on the method of Luxton and Young (1958) 
for laminar flow and the lag-entrainment method of Green, Weeks 
and Brooman (1973) for turbulent flow. Also included are 
correlations to account for laminar separation and reattachement 
and to handle the start and end of transition. 


For the inviscid flow the continuity equation is 
V.pWh=0 (2.1) 


where, p is the density, W the velocity vector relative to the 
rotating blade and h is the stream-tube height. 


Equation (2.1) is solved using a Galerkin weighted residual 
approach leading to a system of equations of the form, 


z [fo WA.IN da = E fownan as (2.2) 
e(1) e(1) $ 


for each node I, where Ng represents the shape function, S 


represents the perimeter of the domain of integration and 

where the summation is over all elements containing I as a node. 
The right-hand side of equation (2.2) is non-zero only for 
nodes on the blade surface and at inlet and outlet to the 
domain. With the transpiration boundary layer model it can be 
shown that, 


esa 
Banas Ss a [rho Wg 6*] (2.3) 


where Ne and W, are the streamwise and normal components of the 
velocity. 


A velocity potential is introduced and three node straight 
sided triangular elements adopted so that the integrals 
involved in equation (2.2) and the left-hand side of equation 
(2.3) are easily evaluated, see (Whitehead and Newton, 1985) . 
The final system of equations may be written as 


A) =Ç ~B 6* (2.4) 
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where $ is the vector of unknown potentials, A is an N*1 

column vector, N being the total number of nodes, C is formed 
from the boundary conditions (being non-zero) only for nodes 3 
on the inlet and exit. boundaries), ô* is the vector of r 
displacement thicknesses and B is an NNS matrix where Ns is 


the number of surface nodes. 
3. DIRECT MODE COUPLING 


Both B and 6* depend on the surface velocity and with direct 
mode coupling are evaluated using the last cycle values. The 
scheme may be written as 


A gr) =c- war) ™ (3.1) 
for the inviscid flow and 


gx (nt) (n41), Co 


= SW, 


for the boundary layer, where the superscript denotes the 
iteration number; Fig. 1. illustrates the linking adopted. 
The system of equations (3.1) are solved using a Newton-Raphson 
linearization technique which putting 


gO. 7+ v (3.3) 
may be written as 
J. =c- alg - (B68) ™] (3.4) 
where J is the Jacobian matrix 
Jj = = (3.5) 


and can be obtained analytically. The iterative procedure is 
repeated until convergence is achieved. Once a new 
approximation to the blade surface velocities is obtained ô* 

can be calculated using the boundary layer method and the right- 
hand side of equation (3.4) up-dated. In practice under 
relaxation of the procedure is used. 


This system is stable and rapid convergence is found for 
attached boundary layers but in regions where the boundary 
layer is separated or near separation then it either fails to 
converge or requires such heavy under relaxation that it is 
impracticable to use. To avoid these convergence difficulties 
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it is possible to employ 'fixes', such as imposing an upper 
limit on the form factor, but this puts the accuracy of the 
results in doubt. 


m~~ Surface transpiration 
t 
Inviscid calculation 
(Direct mode) 
j 


Surface velocity Us 


t 
Boundary layer calculation 
(Direct mode) 


Boundary layer displacement 
thickness 5* 


EE a TR 
Fig. 1 Direct Mode Boundary Layer Coupling 


4. SEMI-INVERSE MODE COUPLING 


To overcome the problems associated with separation, it is 
necessary to adopt a different procedure in the region where 
the boundary layer is separated or near separation, while 
maintaining the direct mode coupling in the rest of the flow 
field. There are two obvious alternatives which could be 
adopted, these are fully inverse mode coupling as shown in 
Fig. 2, and semi-inverse mode coupling (see Fig. 3). The major 
difference between these two procedures is the treatment of 
the inviscid calculation. 


fier Blade surface velocity U; 


t 
Inviscid calculation 
(Inverse mode) 


Effective blade geometry 


Boundary layer displacement thickness * 


NAS ? 
Boundary layer calculation 
(Inverse mode) 


+ 
Blade surface velocity 


iu r AES aoin 


Fig. 2 Inverse Mode Boundary Layer Coupling 
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Blade surface velocity Velocity at boundary layer edge 
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Adjust ô* 
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Fig. 3 Semi-Inverse Boundary Layer Coupling 


The fully inverse mode coupling requires an inverse 
inviscid calculation. This has already been developed for the 
finite element method, by Cedar and Stow (1985b), in the form 
of an influence matrix, which relates changes in velocity to 
changes in displacement thickness. While, it would be possible 
to implement a fully inverse mode coupling procedure, it would 
be complicated to link it to the direct mode coupling. 


The semi-inverse mode coupling is inherently much simpler 
to link with the direct mode because both procedures use the 
direct inviscid calculation. Due to the relative ease of 
implementation, the semi-inverse mode coupling was developed. 
The form of the linking between the direct mode coupling and 
semi-inverse mode coupling is shown in Fig. 4. The semi- 
Inverse Mode Coupling requires an inverse boundary layer 
calculation, the one used is based on the method presented by 
Lock and Firmin (1981), which basically recasts the equations 
used in the direct boundary layer calculation. Besides the 
inverse boundary layer calculation, the semi-inverse mode 
coupling also requires an updating procedure to complete each 
iterative cycle, as can be seen in Fig. 3. Several updating 
procedures were tried, including those of Le Balleur (1981) and 
Carter (1979), but these were found to be slow to converge. 
The analysis, from which Le Balleur produces his procedure, is 
based upon single aerofoil theory rather than blade-to-blade 
analysis required here. A new procedure was therefore sought 
which would meet the requirements of blade-to-blade schemes. 
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Fig. 4 


The updating procedure is required to estimate a correction 


- 


* 
§ , to the displacement thickness distribution, ô such that 


both the inviscid calculation and the inverse boundary layer 


calculation will generate the same velocity distribution in the 
next iteration. 


First consider the direct inviscid calculation. Suppose 
there is a solution P to A I son (2.4), then a change to the 


displacement thickness a can be related to a change in 
velocity potential $7 by equation (3.4), giving:- 


3) 9° B S (4.1) 


The change in surface velocity, Ww, can be related to the 
change in velocity potential by:- 


= D^ (4.2) 
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Hence 


2S 
" 
& 
I 
w 
© 


S (4.3) 


K is called the influence matrix because it gives the influence 
of displacement thickness on the surface velocity distribution. 
The above analysis is essentially that given by Cedar & Stow 
(1985b) and details are given in their paper on how to calculate 
K. 


Changing notation, equation (4.3) is written as 


gt AT 
aii SiC) (4.4) 


where Wi denotes the velocity distribution calculated from 
the inviscid analysis. 
The inverse boundary layer equations are expressed as 
* 
Wes £(6 ) (4.5) 
where we denotes the velocity distribution calculated from the 


boundary layer equations. From equation (4.5) an approximate 


expression for W can easily be obtained. 


aw, 
Wola (4.6) 
“Vv as” 
aw ~ 
where = is called the boundary layer influence matrix 
aô 


because it gives the influence of a change in velocity on the 
displacement thickness. Given that there is a 'true' velocity 


distribution, Wp then 


WoW +W =W + We (4.7) 


An expression for oF can be obtained by substituting equations 
(4.4) and (4.6) into equation (4.7) to give 


ale aw, -l 2 
6 =[k-- 7] (Wo - Wi) (4.8) 
~ as v I 


Equation (4.8) represents a global correction for the 
displacement thickness. 
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It remains to generate the boundary layer influence matrix. 


* 
Examining equation (4.6), it can be seen that if ô is 
replaced by 


3 element (4.9) 


ot tomon o 
t 


then the elements of the j th colum of the boundary layer 
influence matrix can be written as 


ad gh for all i. (4.10) 


So, by perturbing a member of i and using the resulting 
distribution as input for an iñverse boundary layer calculation, 
a colum of the boundary layer influence matrix can be 
generated. By repeated application of this procedure the full 
matrix can be obtained, the resulting matrix being lower 
triangular in form. 


5. RESULTS 


The semi-inverse mode coupling scheme is being tested 
against experimental results. To illustrate the capability of 
the method, two sets of predictions are given here with the 
corresponding experimental data. The test cases are from a 
compressor cascade from the Compressor Dept., Rolls Royce Ltd., 
the blade section being shown in Fig.5 with a finite element 
mesh. The first case shows a small turbulent separation on 
the suction surface of the blade. For this case the predicted 
boundary layer parameters compare well with the experimental 
results (see Fig. 6a). It should be noted that for this case 
convergence can be obtained using the direct mode coupling, 
with heavy under relaxation in 30 iterations while the semi- 
inverse mode coupling only requires 10 iterations. 
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Fig. 5 


The second case has a pronounced suction surface separation. 
This case will not converge with the direct mode coupling but 
converges in 13 iterations with the semi-inverse mode coupling. 
It can be seen that although boundary layer growth is 
underestimated (see Fig. 7a), the results are encouraging. 


The contraction ratios measured on the cascade were imposed 
as linear variations in streamtube height. This simple 
approximation will account for some of the discrepancies seen 
in the Mach number distributions, Figs. 6b and 7b. 


6. CONCLUSIONS 


The semi-inverse mode coupling has been successfully 
implemented in a finite element blade-to-blade program. The 
global update procedure provides fast convergence, allowing 
the program to be run interactively. The comparisons with 
experimental data show that predictions can now be easily 
achieved for separated turbulent boundary layers. 
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ADAPTIVE CONSTRUCTION OF GRID-SYSTEMS FOR FLOW SIMULATIONS 


L. Fuchs 


(Department of Gasdynamics, The Royal Institute of Technology, 
Stockholm, Sweden) 


1. INTRODUCTION 


The accuracy of numerical solutions to differential equa- 
tions modelling flows depends on physical parameters (e.g. 
Mach number, Reynolds number), the geometrical shape and 
numerical factors (local mesh spacing, type of discretization 
and order of approximation). These factors are inter-related 
and only in very simple cases can one determine the effects of 
each factor a priori. A basic parameter in all cases is the 
ratio of the numerical length scale (i.e. the local mesh 
spacing) and the local physical length scale. The last para- 
meter may vary between just a few mean-free-paths (i.e. the 
shock thickness) to as much as ten orders of magnitude larger 
scales (near free-stream conditions). Generally, a priori 
grid generation techniques will lead to either inaccurate 
numerical solutions due to the coarseness of the grid, or to 
a waste of computational resources without any gain in 
accuracy. Thus, in most cases one has to use some adaptive 
technique for solving numerically, non-trivial problems. 


Sophisticated methods for 1-D (steady and time-dependent) 
problems have been proposed by Pereyra et al. (1975), Chong 
(1978) and Davis et al. (1982). An adaptive grid generation 
procedure for multidimensional problems has been developed by 
Dwyer et al. (1980). This scheme is based upon the placement 
of grid points in proportion to the gradients in the dependent 
variables. Brandt (1977, 1979 and 1984) describes a rather 
general way of using Multi-Grid (MG) methods in an adaptive 
manner. The basic principles are simple but the implementation 
of Brandt's adaptive criteria have been limited to 1-D 
problems. These criteria can adapt the local mesh and the 
local order (of finite-difference approximations) so that the 
numerical resultsare optimal in some sense (e.g. minimal amount 
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of computational effort for a given level of accuracy). The 
generalization of the scheme to higher dimensions makes it much 
more complicated and the total efficiency would depend on the 
proper choice of the residual weighting function. 


The present scheme is based also on the MG algorithm. 
Locally refined grids are introduced whenever the local trun- 
cation errors are 'large'. These local grids may be defined 
completely independently from the grid on which the solution 
is given (approximately), by using some proper local coordinate 
system. In this way the amount of the additional memory 
required to store grid data is minimum, The criteria for 
determining the location of the local grids and the stage when 
the process should be stopped are kept as simple as possible, 
Higher order approximations can be incorporated into the 
scheme, in a simple manner, without any need to modify the 
controlling criteria, The relative error of the numerical 
results can also be estimated. In the following we describe 
the MG scheme with controlled local mesh refinements. The 
application of the scheme for some flow problems is discussed. 


2, MULTI-GRID METHODS WITH LOCAL MESH REFINEMENTS 


Basic MG methods use a sequence of m grids G (1sk<m) where 


Kk! 
each coarse grid is derived from a finer one by doubling the 
mesh spacing. The differential problem, Lu=R is approximated 


by the set of problems Lusky + These discrete problems (e.g. 


defined by some finite differences) are interconnected through 
the following relations: 


SPORET k+l 
Ro Rar 7 Meera) + Fe a (2 


Converged solutions, on different grids, are related through 
_ k+l k+l 
woe Jy Waa? The operator a. denotes a restriction 


operator for the transfer of the dependent variable(s). The 
k+1 

operator I, is a corresponding operator for the residual 

transfer. The use of the dependent variables on coarse grids 

together with (2.1) results in the FAS scheme of Brandt (1977 

and 1984). As pointed out by Brandt, expression (2.1) contains 

also information about the local truncation errors since 


k+l _ k+1 k+1. 


Tk = hay ul ~ Ik Tee 1 (2.2) 
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k+l. X 
where m is proportional to the difference between the trun- 


cation errors of L +l and Le The accuracy of the converged 


solution on coarse grids, Wes is the same as the accuracy of 
the fine grid solution ui In this way the function of the 


fine grids may be viewed as providing truncation error correc- 
tions for the coarse grid solvers. In the more conventional 
sense, coarse grids are regarded as means for accelerating the 
iterative process. We use this "dual view" of MG methods both 
for constructing locally refined grids to improve the accuracy 
of relatively (uniform) coarse grid solutions, and for solving 
efficiently the discrete problem. 


Recently (Fuchs, 1985a), we have introduced a local mesh 
refinement technique in the MG solution process, The basic 
scheme has also been extended to include general overlapping 
local grids (Fuchs, 1985b) . The scheme requires (for uniform 
subgrids) the storage of the coordinates of one node point, 
the relative angle of rotation of the local grid, and the 
number of node points in that grid. In all, the scheme 
requires the storage of seven scalars for each sub-grid. To 
describe accurately certain geometries, some of the sub-grids 
may use general, curve-linear coordinates, In such cases, the 
parameters of the node points must be stored only for those 
particular sub-grids. 


3. ADAPTIVE CONSTRUCTION OF GRID-SYSTEMS 


For accurate numerical solutions, with small amount of 
computational effort, we use certain grid and relaxation 
controlling criteria. These criteria are kept as simple as 
possible so that the controlling part of the algorithm 
requires negligible amount of computational work. 


The solution procedure starts on a uniform and relatively 
coarse grid. The grid is refined (by halving the mesh 
globally) and the prob is solved again on the new grid. 

The initial approximation is better compared to an arbitrary 
guess, since the solution on the coarser grid is used to gene- 
rate the new approximation (by interpolation), After repeating 
this global refinement process several times, the grid in 
larger parts of the domain is fine enough, and refinements are 
done only at certain subdomains. Solutions on the composed 
(uniform and locally refined subgrids) grids are computed by 
the same basic MG solver that is used to solve the problem on 
uniform grids. The sequence of solutions u that is computed 


on successively refined grids gives direct information about 
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the relative accuracy of the numerical results, The relative 
error, Eye in the solution uy, is given by: 


e, = l(u* - u )l/lu l; (l<i<m) (3.1) 
i i i = = 


where u* is the (extrapolated) "zero-mesh" solution. This 
relative error estimate is used to terminate the construction 
of more refined grid-systems. That is, when ory (i.e. the 


error on the finest grid-system) is less than some prescribed 
value (say 1%) the solution is accepted and the adaptive 
solution procedure is terminated. 


The mean accuracy of numerical solutions can be improved by 
reducing the local truncation errors. It must be observed 
that successive construction of the numerical solution gives 
information where the errors in the solutions are large. 
However, this information cannot be used to determine where 
the mesh should be refined. This is so since the largest 
errors in the solution and the largest truncation errors do not 
necessarily coincide. The only real measure of the error of 
the approximation is the relative size of the truncation 
errors. On the other hand, the truncation errors cannot be 
used to determine the accuracy levels of numerical solutions. 
Thus, both these measures must be used for controlling the 
adaptive procedure. 


Optimally, the level of the truncation errors on the final 
grid-system is uniform. This goal may be used to define 
regions with 'large' local truncation errors. During the MG 
solution procedure one can estimate [eq. (2.2)] the local 
truncation errors during the transfer of the problem to 
coarser grids. The sequence of local truncation errors, at 
some subregions, reveal when the discrete approximation con- 
verges with the theoretical (asymptotical) rate, This asympto- 
tical convergence rate is achieved only when the local scales 
of the flow field, are being resolved. Therefore, it is 
important to ensure that this is the case indeed (or otherwise 
simulate the subgrid scales by some other model). The regions 
where the grid should be refined is determined by the local 
truncation error, Tis That is, if 


ese Cis, (3.2) 


the grid in that region should be refined. C is a constant of 
order one, and T is the (global) mean of the truncation error. 
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Controlling the local mesh refinements by relation (3.2 
works well in smooth cases, However, many flow problems are 
associated with 'singularities' of different types. Inner, 
incompressible viscous flows, with errors in the outflow 
boundary conditions, are associated with mesh independent, 
non-vanishing truncation errors, near the outflow boundary. 
In the case of transonic flows with shocks, the largest 
‘truncation errors' in the vicinity of the shocks behave as 


-1 
O(h `) (where h is the typical local mesh spacing). In the 
case of driven cavity flows (with viscous incompressible 
fluids), the local truncation errors near the corners close to 


the moving wall behave as o3. In these examples 
(discussed in Sec. 4), and especially in the last case, 
relation (3.2) is not proper for controlling the local mesh. 
In those cases where the truncation error estimates do not 
decrease as the mesh is refined, we use another mesh 
controlling parameter. In these cases we identify the loca- 
tion of the. boundaries of the region that should be refined. 
(Usually, this is a region around the singularity.) We compute 
the mean truncation errors near the boundaries of a sub- 
domain. This truncation error on the grid Gk is denoted by 


th The region (in which the mesh is to be refined) should be 
enlarged when 
É >B EIR (3.3 


where 8 is a constant. Optimally, the ratio of the estimated 
truncation errors would be 2? (where p is the order of the 


discrete approximation on both grids). Thus, B=2 Pisa 
sufficient condition that the local subregion is large enough. 
By numerical experiments we have found that 8 may be consider- 
ably larger. For the transonic cases below, it was found that 
B < 1 is adequate for solutions that are not affected by vari- 
ations in the extent of the locally refined region. 


In all, the simple criteria (3.1)-(3.3) enable us to control 
the process of constructing grids during the solution 
procedure. When the accuracy of the solution (3.1) is not 
used for terminating the refinement process, the intermediate 
grid solutions can be computed by about a single MG cycle. 
Thus, the total amount of computational work to obtain the 
final solution is determined largely by the number of unknowns 


in the finest grid-system. 
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4. COMPUTED EXAMPLES 


The basic adaptive scheme for constructing proper systems 
of grids have been applied both for the computation of incom- 
pressible viscous flows (both in terms of the streamfunction 
and the vorticity, and in terms of the primitive variables) 
and the simulation of potential transonic flows past airfoils. 
Some examples representing both these different flow regimes 
are given in the following. 


The basic MG schemes for the solution of incompressible 
viscous flows are described by Thunell et al (1981) and Fuchs 
(1984). Three-dimensional extensions are given in (Fuchs, 
1985c). The management of local grid refinement is described 
in (Fuchs, 1985a) and the application to multi-zonal grids in 
(Fuchs, 1985b). Here, we study the application of the scheme 
to computation of the flow in a channel with a backward facing 
step, since the problemshave been used in many computations 
(see e.g. Morgan et al (1984)). A typical dilemma in these 
computations is the choice of a proper grid-system. A closer 
study of the different grids that have been used in (Morgan 
et al, 1984) reveal that intuitively one would place a very 
fine grid in the recirculation region, and the grid elsewhere 
is determined, more or less, arbitrarily (depending on the 
method for generating the grid). 


H=0.0313 
Re=250.0 


0.1250 


Re=250.0 


H 


Fig. 1 The streamline pattern in the channel with a backward 
facing step (local mesh refinements). Re=250 
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Fig. 2 The truncation error map of the cases corresponding to 
Fig. 1 


Our computed streamline pattern (Re=250) on a sequence of 
grids (a global grid and three levels of local refinements) is 
shown in Fig. 1. The truncation error maps, on these grids, 
are shown in Fig. 2. On the global (uniform) grid one can 
notice that the largest truncation errors are located close to 
the entrance and near the outflow boundary (and not in the 
recirculating region). The errors located in the outflow 
region cannot be reduced by local mesh refinements. These 
errors are due to the application of free-stream conditions at 
a too short distance. Using other types of boundary conditions 
(see Fuchs et al, 1984) will produce small "truncation 
errors". By local mesh refinements one can reduce the genuine 
truncation errors in a rational and efficient manner (Fig. 2). 


Next, consider the computation of potential transonic flows 
The full details of the MG scheme are given in 
(Gu, 1985). For a lifting supercritical case (£low about a 
NACA-0012 airfoil at free-stream Mach number, M, = 0.80, and 
angle of attack, a = 1°) solutions have been computed on a 
sequence of grids (with local refinements around the airfoil). 
The truncation error levels on a uniform grid and after 2 and 
, respectively, are shown in 


4 steps of local mesh refinements 
Fig. 3. The extent of the truncation errors is reduced, with 


past airfoils. 
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(a) 


Fig. 3a The truncation error map on the finest global grid 
(with 1/h=8, the number of intervals on each side of 
the NACA-0012 airfoil). M=0.80, a=1° 


(b) 


Fig. 3b The same case as in Fig. 3a, after two stepsof local 
mesh refinements around the airfoil (1/h=32) 


Ý 


i 
i 
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() 


e -0.10 
a -1.00 
x —10.00 


a 0.10 
a 1.00 
x 10.00 


Fig. 3c The same case as in Fig. 3b, after two additional 
steps of local mesh refinements around the airfoil 
(1/h=128) 


increased refinement. Three clear zones emerge: leading- 

and trailing edges, and the shock zones. The largest values of 
the truncation errors cannot be reduced by further refinements 
(because of the singularities near stagnation points and 
shocks). In this case relation (3.3) has to be used to produce 
correct criteria for local grid refinements, When ĝ in (3.3) 
is too large, high frequency oscillations appear in the 
solution. These oscillations are detected easily as (large 
amplitude) oscillations in the truncation errors on the local 
grid (see Fig. 4). In such cases, one may increase the extent 


of the local fine grid until (3.3) is satisfied. 
øw —0.10 
a —1.00 
x —10.00 
ø 0.10 
1.00 
x 10.00 


Fig. 4 The same case as in Fig. 3c, using locally refined 


grids in too small regions 
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6. CONCLUDING REMARKS 


Our MG algorithms for the solution of different flow 
problems have been extended so that they could accommodate 
local grids (and variable order of finite difference approxi- 
mations). The controlling of the adaptive grid generation 
process is simple and it requires mainly information that is 
produced during the MG iterative process. By such a grid 
construction, one may use local grids that are uniform. These 
grids have the advantage that they are simple and require least 
computer storage. Furthermore, the convergence of the itera- 
tive (MG) scheme is fastest (in most cases) on such grids. 
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EPIC - BEYOND THE ULTIMATE DIFFERENCE SCHEME 


J.W. Eastwood and W. Arter 
(Culham Laboratory (Euratom/UKAEA 
Fusion Association), Abingdon, Oxford) 


l. INTRODUCTION 


EPIC - Extended or Ephemeral Particle-in-Cell schemes 
combine ideas of particle-mesh methods and finite element 
methods to give numerical schemes which outperform state-of- 
the-art finite difference schemes in both cost and quality. 


Finite element and particle-mesh methods were initially 
combined in an elegant Hamiltonian formulation by Lewis (1970; 
Hockney and Eastwood 1981). In multidimensional phase-fluids, 
integration is performed most economically using monte-carlo 
techniques, in which sampling points are commonly interpreted 
as particles. In (magneto-) fluid dynamics, the need to use 
particles as the principal carriers of information becomes 
less compelling, in which case the particles can become 
ephemeral, as in EPIC (Eastwood 1982). In 1-D, or with 
fractional timestepping, integrals become easy to do exactly, 
in which case particles become merely conceptual devices and 
EPIC schemes appear similar to the characteristic finite 
element methods (Labadie et al. 1981: Morton 1982: Pironneau 


1982). 

In present-day fusion research, devices such as tokamaks 
operate with small diffusivities, yet resistive instabilities 
can lead to the convective redistribution of heat on short 
timescales. Strong magnetic fields cause large anisotropy 
of heat and particle flows. Changing field topology and, in 
3-D, ergodic flux-lines make field-aligned meshes impractical. 


We believe EPIC offers an effective solution to these problems: 
anisotropies are well handled by f 


inite elements, and the 
almost-hyperbolic limit favours our schemes which treat advec- 
tion accurately without instability. 
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2. EPIC 
2.1 Lagrangian description 

For low dissipation systems it is natural to think in terms 
of motions of parcels of fluid, We therefore follow the ideas 
of particle-mesh methods and transform the governing (almost-) 


hyperbolic equations into co-ordinate mappings. 


We recall that if a parcel of fluid initially at Xo moves 


to x at later time t, so that 


z = x(x,t), (2.1.1) 


then the infinitesimal volume at, at Xo is mapped to 


at = [olat (2.1.2) 
where 


ox | 


(2.1.3) 


Pu D Suc 
dy ER SE ij" t=0 E 


D is the displacement gradient matrix, and |p| is its deter- 


minant. Similarly, the transformation of a vector element of 
surface area aS5 > dS is described by 


-1 
ds = ds .D |p|. (2.1.4) 
It follows that the density p satisfies 
-l 
= |D 
erea hEn (2.1.5) 
in the absence of source terms, and the magnetic field B obeys 


2 = [ol D.B 


X10) (2.1.6) 
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in the absence of resistivity. Sources s add to e.g. (2.1.5) 
the term 


-1 
[o1 f$ Iols att, (2.1.7) 


which we neglect in order to keep the exposition simple, but 
which can be calculated using standard numerical techniques. 


2.2 Discretisation 


We project (2.1.5) onto the set of basis functions 
{W (x,t) } to give the equations for the amplitudes {p} in 


form:- 


fw, [6 - pglDl ‘Jax =- fu, ear =0, (2.2.1) 


where p is the trial function approximation to p, i.e. 


p=p +e, p= Pabor dog? are trial functions and the summation 


convention is used. Depending on the choice of time-dependence 
of (wd, (2.2.1) describes (i) a Lagrangian finite element 


method, (ii) an Eulerian EPIC scheme or (iii) a moving mesh 
EPIC scheme. 


For (i) {wW} satisfies W,(x) = W, (žo) and (2.2.1) becomes 


JW, (20) by (26) 4) Bg = Wy (Zo) P (žo) AT (2.2.2) 
while for (ii) ow, /3t = 0 so 
(Jw, (2) (wat), = JM, e (Ko) at. (2.2.3) 
(2.2.2) and (2,2.3) may each be written as 
(2,2.4) 


Akabe = Tk 
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which is to be solved for unknowns {p,}. m, may be interpreted 


as the mass associated with node k. For (i) node masses are 
fixed and the mass matrix Akg changes at each timestep (because 


the geometry and connectivity of the elements vary), vice versa 
for (ii), while both Arg and me change in (iii): (iii) 


corresponds to adaptive elements which e.g. align themselves 
according to the direction of the magnet field locally (Eastwood 
and Lee-Hsaio 1983). In each case, summing over k shows the 
scheme is conservative. 


For the remainder of the paper we shall assume the Galerkin 
approximation i.e. {w} and thp? are the same functions, 


(2.2.1) then minimises the mean-square error in p, and in 
addition nonlinear instability is suppressed because positive 
definite quadratic quantities can be shown to be non- 
increasing. We specialise to case (ii) since studying 
arbitrarily moving elements introduces complications but no 
fundamental differences. 


The lowest order elements which conform for both advection 
and diffusion terms are piecewise linear. In 1-D, elements 
of unit width are given by: 


W(x) = W(x), W(x) = J1-[x| > [x] <2 (2.2.5) 
fe) ; otherwise, 


The right-hand-side of (2.2.3) becomes 


m = foi, (x+E)W, (20) ax Pont (2.2.6) 


where € is the displacement of point Xo in the time interval 


(t,t+At). Representing & by piecewise linear functions 

reduces the integrand in (2.2.6) toa piecewise cubic; (2.2.6) 
is given by the overlap of two "triangle" functions (Fig. 1). 
The mass matrix on the left-hand-side of (2.2.3) is tridiagonal 
with stencil (1/6, 2/3, 1/6), so (2.2.3) is easily inverted to 
give p at time t+At. This gives the timestep loop: 
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i 
| 


Ey 0 k-ké 1 xl 
ae 3 kL tën 


Fig. 1 Contributions to AL, (8) are given by the overlap 
integral of Wy (x9) with the displaced and deformed 
triangle function Wy (xt) . 


i) t := ttt 
ii) find displacements & 


iii) compute node masses {m} 


iv) solve matrix equation p = Aon to get {o, (t+At) } 


v) go to (i) 
3. LINKS WITH OTHER SCHEMES 


3.1- PCI 


Steps (iii) and (iv) in the above timestep loop are like 
flux-corrected-transport (Boris and Book 1976 and refs. 
therein) in that m may be interpreted as a low order approxi- 
mation to new densities pọ (t+At) and inverting the mass matrix 
as anti-diffusion. Explicitly, (2.2.4) with linear Eulerian 
elements becomes 


(op atok Ok) /E = Bk’ 
(3.1.1) 


1 
i.e. Pk Set A (p41 2k k- y 
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which has the form of a diffusion equation with coefficient 
-1/6. 


3.2 Godunov 


The "Lagrange then remap" procedure has been shown to be 
effective in finite difference schemes for compressible flows 
(van Leer 1979). We remark that the right-hand-side of 
(2.2.6) amounts to a mapping of a density distribution 
obtained from a Lagrangian procedure back onto a fixed mesh 
(cf. Fig. 1). 


Fede PLO, 


The right-hand-side of (2.2.6) is the "charge-assignment" 
operation in the jargon of particle-mesh methods. Such methods 
are concerned with multidimensional calculations where the 
integral is hard to evaluate analytically, so quadrature is 
used instead. Letting ôm; = P (Xoi) Toi be the mass associated 


with quadrature point i, the troublesome integral is approxi- 
mated by 


m “Ea Cos + E)ôm, (3.3.1) 


This may be interpreted as: take particle i at Xoi”! move it to 


x, = Xi tS and assign mass to node k according to the assign- 


ment function Wk . 


The diffusive nature of conventional fluid PIC follows 
because in effect it lumps the mass matrix. However, when 
terms of the form (2.1.7) are treated by one- or two-point 
quadrature at each timestep, it is natural to use the 
advection/diffusion splitting of PIC, and if nodal values are 
regarded as principal variables, particles can be interpreted 
simply as quadrature points for awkward integrals. 


4. EPIC COMPARED TO OTHER SCHEMES 


4.1 Linear analysis 


Linear analysis gives the variation with wavenumber of 
numerical dispersion and dissipation. The best schemes 
generally have small damping for wavenumbers with small phase 
errors, and large dissipation where phase errors are large. 


EPIC - BEYOND THE ULTIMATE DIFFERENCE SCHEME 587 


Applying the usual procedure to EPIC for a problem with 
uniform advection (€ = constant) and node spacing, we derive 
first the linearised equation, a convolution, 


T(p-a) pg (t+at) =T (p-q-E) ogl), (4.1.1) 


where for triangle functions 


Itz) = | (4-2z°[2-]z|])/6 ; |zls1 
2-|z|)3 ; 
(2-|2|) 3/6 ; 1s|z|<2 Cake oy 
fo} ; otherwise 


Fourier transforming (4.1.1) gives the dispersion relation 


a = ebt = ¥(K,E) /E(k,0) » (4.1.3) 


which is to be compared with the exact result 


wht = ké. (4.1.4) 


Linear stability is assured since A lies in the unit circle 
for all &. 


The dispersion of EPIC with linear basis functions is 
compared with that of Lax-Wendroff and upwinding in Fig, 2(a) 
and with minimum phase error /FCT and the high order Godunov 
scheme PPM (Colella and Woodward 1984) in Fig. 2(b). All 
curves are drawn for a Courant number C = 0.25: larger timesteps 
are more flattering to EPIC, but we have to choose a value for 


which the other schemes are stable. 


Lax-Wendroff has large phase errors and small damping so we 
expect (and find cf. Section 4.2) spurious wiggles and 
incorrect propagation speeds. Upwind has smaller phase errors 
but much larger dissipation. FCT has about the same phase 
errors as EPIC, but greater damping at shorter wavelengths 
and is weakly unstable at long wavelengths: PPM has slightly 
larger phase errors and dissipation intermediate between EPIC 
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Fig. 2b 


(a) and (b) Real (phase) and imaginary (damping) parts of the 


frequency plotted against wavenumber for uniform advection. 
The curves are identified in the legends. 
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and FCT. For wavelengths greater than 4 node spacings 
(wavenumbers < 0.5), Fig. 2(b) shows there is little to choose 
between EPIC, FCT and PPM on the basis of linear analysis. 


4.2 Kinematic test problem 


We have developed a test problem based on the 1-D continuity 
equation to compare schemes. A gaussian hump is transported 
through a unit length periodic box by a velocity field u(x) 
that provides both compression and expansion. We take 
u = 2-sin2m™x, N = 100 nodes in a spatial period, C = 0.25, anda 
gaussian density hump placed initially at the velocity maximum 


(p (t=0) = expl- [x-1/4]7/0*) , o = 0.1). 


The compression reduces the effective o of the gaussian to 
approximately 3 node spacings at smallest (proportionately 
increasing its height to conserve total mass). The compressed 
p Fourier transforms to a gaussian, the amplitude of which is 


< 2x109 of peak value for wavelengths less than two mesh- 
spacings, i.e. p can always be resolved to within round-off 
error (of 4-byte arithmetic) by the chosen mesh, Specifying 
u(x) avoids the special case of uniform advection while 
eliminating the velocity field as a source of error. Thus our 
test problem should provide a sensitive measure of the perfor- 
mance of a scheme. 


Fig. 3 shows the outcome for the five schemes of Section 
4,1. The broken curves are the initial conditions and the 
solid ones the result after transport through a distance of 
3 periods (about 2000 steps); broken and solid should be 
identical in the absence of error. Fig. 3(a) and 3(b) show 
that Lax-Wendroff and (conservative) upwinding perform badly, 
as predicted in Section 4.1. Fig. 3(c) illustrates the profile 
steepening that is characteristic of FCT. From Fig. 3(d) PPM 
has more accurate slopes than FCT, but Fig. 3(e) reveals that 
EPIC clearly gives the best approximation. The cpu times 
taken were 0.21 for (a), 0.26 for (b), 0.44 for (c) and 
1.39 for (e), all measured relative to EPIC. Thus EPIC is of 
comparable speed to the schemes it outperforms. 


4,3 Nonlinear test problem 


We apply the linear EPIC scheme to Burgers' equation 


av OM eee (4.3.1) 
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Fig. 3 Initial (dashed curve) and final (solid curve) profiles 
of a gaussian convected by a sinusoidally varying velo- 
city field as described in Section 4.2, for (a) Lax- 
Wendroff, (b) upwind, (c) FCT, (d) PPM and (e) linear 
basis function EPIC schemes. 


EPIC - BEYOND THE ULTIMATE DIFFERENCE SCHEME 591 


(b) 


Fig. 4 Sequences showing overall ređuction in amplitude as 
time increases of the solutions to Burgers' equation 
described in Section 4,3. Linear basis function EPIC 
is used with predictor — = våt; C = 0.5, N = 100, and the 
curves are drawn at intervals of loodt. 


R = (a) 10, (b) 10°, (c) 104 and (d) 10°. 
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on the unit periodic interval with initial conditions 
v = - sin2tmx. €& is evaluated as v(t)At, otherwise the 
procedure of Section 4.2 is copied. 


For large Reynolds number R, the sinusoid rapidly becomes 
a "sawtooth", then decays on the resistive timescale. The 
significant spatial harmonic of highest frequency in the 
problem has wavenumber 0.75R. Fig. 4(a) shows that when R=100, 
solutions v are indeed accurately resolved and smooth. At 
higher R wiggles develop, but note that unlike many finite 
difference schemes which are nonlinearly unstable at such R, 
due to aliasing error (Eastwood and Arter 1985), EPIC remains 
stable. The appearance of oscillations is the Gibbs 
phenomenon, caused by neglect of significant sub-grid-scale 
harmonic content. Appropriate, sub-grid-scale physics models 
can be used with EPIC (as with any other scheme) to remove 
these wiggles. 


5. CONCLUSIONS 


We have demonstrated that EPIC outperforms state-of-the-art 
finite difference schemes such as FCT and PPM for about the 
same computational costs, EPIC schemes are consistent, give 
unconditionally stable kinematics, and the Galerkin variants 
can be shown to be free from nonlinear instabilities, Optimal 
accuracy arises because EPIC causes errors in the principal 
variables to be minimised. 


The key ideas behind FCT, PPM and PIC methods have been 
shown to be embraced by the EPIC formalism, whilst knowledge 
of particle-mesh techniques provides insight into the best 
ways of tackling tricky overlap integrals that appear in EPIC. 
The finite element formulation used also allows a variety of 
element connectivities and geometries to be employed (although 
the flexibility has to be paid for in increased computational 
costs associated with indirect addressing, element geometry, 
etc.). We expect EPIC to be useful over the whole gamut of 
fluid and MHD problems. 
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GRID GENERATION AND FLOW CALCULATIONS 
FOR COMPLEX AERODYNAMIC SHAPES 


N.P. Weatherill, J.A. Shaw and C.R. Forsey 
(Aircraft Research Association Ltd, Bedford) 


l. INTRODUCTION 


The generation of coordinate systems about complex 
configurations is generally recognised as a pacing item for 
computational aerodynamics. Although the literature on grid 
generation is now extensive, few methods have been shown to 
be applicable to general three-dimensional configurations. In 
addition, grid generation has too frequently been viewed as an 
isolated topic with little attention being given to the hosted 
flow algorithm. In this paper we describe a multi-block method 
of generating body conforming grids for complex three- 
dimensional configurations and give flow calculation results 
for realistic aircraft using a finite volume flow algorithm 
for the Euler equations. 


2. MULTI-BLOCK - GENERAL PRINCIPLES 


The philosophy we have adopted for grid generation has been 
detailed previously (Weatherill and Forsey, 1984). The whole 
flow field, between the surfaces of the configuration and some 
outer farfield boundary, is broken down into a set of blocks. 
The union of these blocks fills the entire flow field without 
either holes or overlaps. Each block is chosen to be 
topologically equivalent to a cuboid, in that it has six faces 
and eight vertices, and can be mapped into a unit cube in 
computational space. The mapping between computational and 
physical spaces is performed using the elliptic grid generation 
approach of Thompson et al (1974). A user constructed topology 
file specifies the arrangment of blocks and thereby defines the 
topological structure of the grid. To simplify the internal 
program logic, a single boundary condition is imposed on each 
face of each block. 
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Block boundaries which are part of the configuration or 
outer-boundary have the grid specified on them a priori by a 
surface grid generator. They are referred to as Dirichlet 
block boundaries. Block boundaries, which are not part of the 
configuration or outer boundary, are purely notional, in that 
they have no physical significance, their position in space 
evolving in a similar manner to any point interior to a block. 
They are referred to as continuity block boundaries. On such 
boundaries, internal program logic, driven by the topology 
file which contains the relevant information about adjacent 
block number, face number and relative orientations of 
adjacent faces, is used to access the coordinates of points in 
neighbouring blocks required by the solution procedure. Across 
these block boundaries, the grid is as smooth as at any point 
inside a block. 


3. SURFACE GRIDS 


An intrinsic part of our approach to surface grid 
generation is a geometry package based on bi-cubic surface 
patches. The geometry of each component is input in terms of 
cross-sections and converted internally to a continuous 
function of two parametric coordinates S = S(s,t) using a Coons 
patch formulation (Coons, 1967). This transformation from a 
discrete to a continuous description of the surface enables 
component intersections to be calculated using a Newton-Raphson 
routine. The resulting parametric coordinates of the 
intersection lines are then used as fixed Dirichlet data for 
the surface grid generation. 


The surface grids, whose topological structures are derived 
from the field grid topology information, are generated as 
solutions of the two-dimensional elliptic equations of Thompson 
et al (1974), expressed in parametric coordinates S = S(s,t) 

ti ery ait das 
Si = -p Soi 3.1 
i g Ẹ gl P E ( ) 
J 


ae 
where g are the metric terms, oe the control functions and 


zt the computational coordinates with i,j summed over 1 and 2 
in the usual tensor fashion. The non-linear algebraic 
equations resulting from the discretisation of equations (3.1) 
are solved using a successive point over-relaxation iterative 
scheme, with all points in a given block being updated before 
advancing to the next block. 


The solution values are mapped back to physical space via 
the Coons patches, which ensures mathematical consistency with 
the geometry definition. They are then used as fixed Dirichlet 
data for the field grid generation. 
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4, FIELD GRIDS 


The positions of points in the field are determined as the 
solutions of the three-dimensional elliptic grid generation 
equations which are equivalent to (3.1) with S = S(s,t) 
replaced by X = X(x,y,z) and the tensor summation for i,j taken 
over the values, 1, 2 and 3. In a similar manner to the 
surface grid technique, the equations are solved iteratively 
with a point over-relaxation scheme using block-by-block update. 


Inherent in our multi-block approach is the acceptance of 
singular points and singular lines where the Jacobian of the 
transformation is zero. In physical space such singularities 
manifest themselves as points in two dimensions at which other 
than 4 lines meet or in three dimensions where other than 6 
lines intersect. These points are identified automatically by 
analysing the topology file and their position determined as 
the average of the points surrounding them. 


For the field grid generator, as for the surface grid 
generator and the Euler flow code, indirect addressing, which 
we call a pointer scheme, is used to access variables from 
adjacent blocks required by the solution scheme on continuity 
boundaries. For a given topology, this offers a considerable 
saving in computer storage compared with a halo type scheme 
which requires all variables from adjacent blocks needed by the 
solution scheme to be stored in a halo around each current 
block. In this way, the halo scheme effectively increases all 
block dimensions by two for the grid generators and four 
for the Euler flow code. The drawback with a pointer scheme 
is that it is more difficult to take advantage of the 
vectorisation capabilities of modern computers, Nevertheless, 
for the field grid generator we have been successful in 
vectorising such a scheme and have reduced run times by a 
factor of five compared with an unvectorised halo scheme. 


The field grid generation has highlighted three particular 
problems: (i) the difficulty in viewing and presenting three- 
dimensional grids, which indicates the need for very 
sophisticated interactive graphics software, (ii) the grid cross- 
over property, exhibited by the discrete form of Thompson's 
equations, which occurs particularly in 3D around sharp edges 
such as wing tips, and (iii) the satisfaction of a particular 


convergence criterion. 


To overcome problems of grid cross-over, which are 
particularly apparent in the more complex topologies we have 
examined, we have fixed additional block boundaries in the 
field. These boundaries are chosen to produce surfaces of 
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approximately constant x, y or z. The grids on these surfaces 
are generated using the surface grid generator for two of the 
variables, x and y, say, and the approximately constant z 
values found by solving a two-dimensional Laplacian equation in 
Z. Fixing such boundaries has the additional benefit of 
improving convergence and provides an effective technique for 
grid control. A drawback of this approach would be if the 
flow algorithm was sensitive to slight discontinuities in the 
slope of the grid and illustrates the importance of 
considering the hosted algorithm in the grid generation 
process. 


5. FLOW ALGORITHM 


The flow algorithms now available for solving the Euler 
equations combine satisfactory accuracy with acceptable 
convergence rates, making the prediction of transonic flow 
over a complete aircraft a realistic proposition. A finite 
volume explicit Runge-Kutta time stepping scheme based on the 
work of Jameson et al (1981) has been coded to work within a 
multi-block framework. As with the grid generator, all 
variables are stored in a one-dimensional array and a pointer 
scheme is used to balance fluxes across continuity block 
boundaries. Again, the topology file gives information about 
the boundary conditions for the flow code on each block face. 


An important aspect of our Euler multi-block work is the 
block-by-block update of all flow variables. In a single 
block grid, all variables in the field are updated to an 
intermediate time level before advancing to the next stage in 
the Runge-Kutta scheme. However, in the multi-block scheme, 
the flow variables are updated to the new time level block-by- 
block to save on computer storage. Consequently, on a 
continuity block boundary, the variables required from an 
adjacent block could either be frozen at the old time level or 
have already been advanced to the new time level. This mixing 
of new and old values could impair convergence rate as the 
number of blocks is increased. However, our experience 
suggests that convergence rate is only slightly degraded. 


No special formulation of this cell centred Euler algorithm 
is required at singular lines and points and limited 
investigations have indicated that the code is insensitive to 
such features. 
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6. RESULTS 


We have used the three main types of mapping, O-grids, 
C-grids and H-grids, as the basis for constructing various 
grid topologies around civil and combat aircraft. Each mapping 
has its attractions and limitations. O-grids are 
computationally efficient, requiring fewer grid points to fill 
the same volume, but topologically very difficult to construct 
around all components. C-grids are less efficient than O-grids 
but offer an improved grid structure around areas such as wing 
trailing edges and wakes. H-grids are computationally 
inefficient, their appeal lying in the simplicity of adding 
extra components to a configuration by opening up a slit in 
a cartesian-like grid. 


At present, both the field grid generator and the Euler code 
run in core on a 1 megaword Cray 1S and hence are restricted to 
rather coarse grids. Thus, the following examples are more 
illustrative of the types of grid topologies obtainable rather 
than actual, practically usable, grids. In Figs l, 2 and 3 we 
show three different grid topologies which have been 
constructed for the RAE wing A body B2 test case (Treadgold 
et al, 1979). In Fig. 1 we have a C-grid in the chordwise 
direction and an H-grid in the spanwise direction. The 
topology is made up of 24 blocks and there are 34,000 points 
in the field. 
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Fig. 2 shows an eight block grid with 17,000 field points; 
the chordwise and spanwise grids are both O-grid structures. 
In Fig. 3 we have chosen an O-grid in the spanwise direction 
and a C-grid in the chordwise direction. This topology is 
comprised of 24 blocks with 32,000 field grid points. 


The Euler flow code was run for 1000 time steps on all these 
grids for a subcritical case, M=0.4,a=0. A comparison 
of the results obtained for this case, along with experimental 
results and a single block full potential result using the 
ARA FP wing/body code with an equivalent number of grid points, 
are shown in Fig. 4. Given the coarseness of the grids, the 
results are quite encouraging. A transonic flow result, 
M=0.9, a9 = o, is shown in Fig. 5. 


+ EXPERIMENT 
:—-— C-0 GRID 
4 —-— 0-0 GRID 
----- C-H GRID 
4 — FP 


jj M=0.4 g=0 Ņ=075 


o 
Fig. 4. Wing A/B2 M=0.4 a=0 
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M=09 a=0" 7=0-75 
+ EXPERIMENT 
n ----- C-H GRID 


04 
Fig. 5. Wing A/B2 

A particular geometry, currently of interest, is a canard/ 
forward swept wing/body configuration where we have chosen to 
have an O-mapping in the chordwise and spanwise direction of 
both lifting surfaces. As a comparison for the flow results, 
we have generated a field grid by spanwise stacking of strea 
streamwise two-dimensional grids generated by the surface grid 
program on a wing/canard configuration. This topology has 
C-grids in the chordwise direction and an H-grid in the 
spanwise direction and is comprised of 54 blocks. Fig 7 shows 
the grid obtained for this topology with no grid control. 
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To provide a mechanism for controlling grid spacing we have 
developed a multi-block version of the grid control technique 
of Thomas and Middlecoff (1980) and implemented it in the 
surface grid generator. The control functions are made 
continuous across block boundaries and automatically modified 
in regions of high curvature to prevent crossover. Fig. 8 
shows the grid generated for the configuration shown in Fig. 7 
when grid control has been implemented. A comparison of the 
flow results obtained on the grids of Figs 6 and 8 for M = 0.7 
and a = 4° are shown in Fig 9 for a station threequarters of 
the way along the wing. 


C-C-H Grid 


Fig. 8. Wing/Body fanard (With Control) 
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Fig. 9. Wing/Body/Canard 


For those not familiar with multi-block grids, the concept 
of visualising 30 blocks mapped in O-O structures is not easy. 
In Figs. 10 and 11 we show schematics of the grids shown 
respectively in Figs. 6 and 8. 


9-0-0 Grid 


Fig. lo. Wing/Body canard (Schematic) 
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C-C-H Grid 


Fig. 1l. Wing/Body/Canard (Schematic) 


It is clear from the above results that the work is still 
very much at an initial stage but that our multi-block approach 
provides a sound basis for grid and flow calculations for complex 
configurations. Finally, for completeness, we show two civil 
aircraft configurations which are currently under study using 
the multi-block approach. Fig. 12 shows a 30 block wing-body- 
tailplane grid and Fig. 13 a 150 block wing-body-nacelle-pylon 
grid. The topology file for this final case, made up of C-grids 
and H-grids, was generated automatically, offering a considerable 
saving over manual specification of the topology file. 
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Fig. 12. Wing/Body /Tailplane 0-0-0 Grid 
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Fig. 13. Wing ody/Pylon/Nacelle Surface Grid 


7. SUMMARY 


A general multi-block grid genération method has been 
described for computing coordinate systems (grids) which are 
aligned with each of the components making up an aircraft. An 
established formulation for solving the Euler equations has 
been structured to accept a multi-block grid and preliminary 
results on various topologies indicate that the methods 
described can form a system which can successfully predict the 
flow around a wide variety of complex aerodynamic shapes. 
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FCT APPLIED TO THE 2--D FINITE ELEMENT SOLUTION OF TRACER 
TRANSPORT BY SINGLE PHASE FLOW IN A POROUS MEDIUM 


A.K. Parrott and M.A. Christie 
(British Petroleum Exploration, London) 


l. INTRODUCTION 


This paper describes a new adaptation of the Flux Corrected 
Transport technique of Boris and Book (1973, 1975), Zalesak 
(1979) to the finite element solution of a 2-D advection - 
diffusion equation. Results for the problem of propagating 
step discontinuities show the method to be monotone and to give 
sharp resolution of fronts. 


The examples derive from the problem of modelling the inter- 
well flow of tracer in water injected into a petroleum 
reservoir. These problems are very similar to those of 
modelling pollutant transport in groundwater, an area in which 
finite element methods have been used for some time (Pinder, 
1975). It is well established that for small dispersion 
coefficients numerical oscillations occur in the standard 
Galerkin formulation and resort is usually made to upstream 
weighting of some kind: see e.g. (Sun and Yeh, 1983). To 
avoid excessive numerical dispersion these methods effectively 
interpolate between full upstream weighting and standard 
Galerkin using a parameter which will in general depend both 
on the mesh variation and the problem type. Specifying this 
Parameter in advance can constitute a drawback for such 
techniques. 


The flux corrected transport technique described below is 
capable of resolving fronts over 3 nodes without any numerical 
oscillation. The FCT algorithm used is that of Zalesak (1979) 
with some minor modifications to extend it to triangular 
meshes and finite element flux terms. It effectively consti- 
tutes a parameter-free non-linear interpolation between full 
upstream weighting and standard Galerkin. 
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2. FORMULATION 
2.1 Concentration Equation 


The following non-reaction chemical transport equation 
provides a simple model of the tracer problem we are interested 
in solving. We have assumed a constant thickness single phase 
reservoir with a number of injection and production wells 
represented as delta functions. i.e. 


dc 
aa V.t(uc) = aY. (D (u) Vc) + Q (yan G D 


for all (x,y) €Q, with boundary condition 


ge. fo) for all (x, y) ef TRE 


where T is the boundary of 2. The well term Q is a sum of 
source and sink terms as follows: 


n Pyne m 
inj 
Q= & qe 6 (x ay Sead Aga Oh Cae sage), (2.1.3 
rel FY =r AaS TLAS 


it A 
where a is the injection concentration at each of the n 
injection wells and cS =C (x t) is the in-situ concentra- 


tion at each of the m production wells. The dispersivity 
tensor has the standard form 


D (u) = (2.1.4) 


where as is the longitudinal dispersion coefficient. 
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2.2 Velocity Field 


The velocity field u is obtained via Darcy's law from the 
pressure field. This is calculated initially and for an 
incompressible single phase system remains constant throughout 
the tracer calculations, providing the rates do not change. 
The pressure equation is solved conventionally using linear 
finite elements on the same mesh as the concentration equation 
and using local singularity subtraction at the wells as 
described in (Spivak et al., 1977). The singularity subtrac- 
tion is facilitated by the use of a mesh generator which auto- 
matically matches node positions to well positions. The 
approximate pressure field has the form 


N mn ag [x = x] 
P(x,y) = ÈE Pi, (x,y) + I hogs (x,y) 
sl s1 s Rs IE 
(2.2.1) 


where 95 (s) is the linear basis function for the nodes coinci- 
dent with the position of well s, RS is the maximum extent of 
95 (5) and Ks is the permeability at well s. The velocity 
field is then given simply by 


N m+n q 
u=KVp=K £ P, V, +K I Vv (1 - R ; ) 
= MAS jel que an anK, wes og (Lx x] al ig 
(2,2.2) 
a, + By (2.2.3) 


where Uy is non-zero only for those elements having a well at 


a vertex. 
2.3 Upstream Weighted Finite Elements 


In this section a "low-order" finite element approximation 
to the concentration equation is derived which uses mass 
lumping and a convenient form of upstream weighting. Since 
high Peclet number flows are of most interest the physical 
dispersion coefficient is set to zero from hereon and the 
limiting hyperbolic case considered. 
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The mass lumping factors a, are obtained by summing 1/3 


of the area of each triangle with node j as a vertex. Equation 
(2.1.1) is then approximated by 


n+l n. 
Mea = <u C°, VØ > +< 
ej ) Liei v6, u 


: ctl vø 


+1 
r > + SOn > yg, > 


j 


for j=l,...,N (2.3.1) 


where c is the nodal value of the concentration at time 
t = nåt and the L, inner products <u C, w are integrated 


in the following upstream sense. Consider the element flow 
integral, 


e 


$j 


) C dx dy (2:322) 


= f ug 
aa = a) 


on an element with node j as a vertex, where Ta is the element 


area. An upstream approximation to £ is then 


j 
£2) UPS _ (o8, ups 
(#5) (c}) oJ u (VØ) dx dy (2.3.3) 
= Co S (2.3.4) 


where the upstream concentration (c$) UPS with respect to node 
j is given by j 


c; for vzo 
(cê) UPS t 
? (233.5) 
PIINE e 
CRE e for Yj <0 
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om = max (O, vD , and the nodes of the element are denoted by 


i,j,k. The quantity S is simply the flow vector u resolved 


perpendicular to the side opposite node j. 


These upstream element flow integrals are assembled in the 
standard manner to give an upstream evaluation of the flow 
terms in equation (2.3.1), i.e. if 


FY (u,c) = £ (£°)5PS (2.3.6) 
i= a J 


for all the elements having node j as a vertex, then (2.3.1) 
can be written as 


n L n L n+l n+l 
a (cj - c) UC WE EY (age Co) + <Q 5 W> 


for j=1,...,N (2437) 


The solution of this equation is monotone provided that the 
timestep is kept within the appropriate CFL limit. It is 
however very dispersive as can be seen in Fig. l. The 
implicit terms in (2.3.7) are easily dealt with since ug is 


radial around wells; no matrix inversion is required. 
2.4 A Finite Element FCT Algorithm 


Equation (2.3.7) contains three sources of inaccuracy, 
namely the upstream weighted flow terms, the first-order time 
integration used, and the mass lumping. These inaccuracies 
can be corrected in a monotonicity preserving fashion using 
Zalesak's FCT algorithm providing they can be expressed as 
explicit conservative fluxes. We have only corrected the 
first two inaccuracies: however Donea (1984) describes an 
explicit correction to mass lumping which is conservative 
and could easily be incorporated in what follows. The high- 
order fluxes used to correct (2.3.7) are taken as 


j=1,...,N (2.4.1) 
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where only the B component of the flow field is used in the 
corrections as the Up component would involve implicit terms. 


The second term is a Lax-Wendroff correction term which con- 
veniently depends on the physical dispersion tensor defined in 
(2.1.4). 


The element flux limiting algorithm is a straightforward 
adaptation of the FCT algorithm, as follows 
(i) compute the upstream weighted cP = Git: from equation 
(2.3.00) 


(ii) compute the high-order fluxes from equation (2.4.1) and 
then calculate the anti-diffusive flux corrections 


L n e 
= - = 2.4.2 
aj Fy upa Ca 5 peo) Z aj ( ) 


(iii) limit the individual element flux corrections 


a’ = 8 as o< B? 


(iv) correct the upstream concentrations at t=(n+1)At using 


EL up) _ 2.4.3 
(cy creya ( ) 


Clearly for 8 = 1 in step (iii) the usual oscillatory solution 
would be obtained and so the selection of limiters is crucial 
to the properties of FCT. The choice of limiter for each 
element is based on the multidimensional FCT limiter of 
Zalesak which is restated here for convenience with minor 
adaptations for triangular elements. 
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For each node j define the following 


pt = E max (O, ay) for each element having node j as a vertex, 
j 
gf =a, (E - oP) sat, (2.4.4) 
J j J j 
+ + 
min (1, 9f/P}) P >o 
i! AS, J 
R, = 
j pt 20 
2 j 
and similarly 
= 4 e 
Bi = -E min (0, a5) ' 
= up in. 
7 = 8. (CLP - EN sat (2.4.5) 
Qj 5 ( j j / , 
in (1, Q./P,) PESO 
i Bia (Ly 95/2; 7 


R, = 
J 
o 


$ 
The RA and He terms are then sorted for each node of a given 


element into element values RS where 


e 
R; = 2.4.6 
j ( ) 


One choice of element limiter g? is then 


Creve! e e e 
B` = min [R,, Ry Rg] (2.4.7) 


where i, j, k are the element nodes. 
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It remains to specify the maximum and minimum values 
cen eg for each node: again following Zalesak we put 


DX max (cP; cP, oP, L (2.4.8) 
J Y j 3 3" 


and similarly for ri 


connected to node j in the triangulation, Values from the 
D) can also be included into (2.4.7). 


j 


, where j, j" etc. run over all nodes 


previous timestep (i.e. C 


3. DISCUSSION 


Figs. 1 and 2 show the propagation of a bank of 500 ppm 
tracer from left to right across a symmetry element of a 
repeated 5-spot injection/production well pattern. ` The 
contour plots show the FCT finite element solution and the 
well-to-well profiles compare the FCT solution with the 
upstream scheme (8=0) and the Galerkin Scheme (f=1). The 
timestep used is the same for each case, giving a maximum 
Courant number of 0.7 away from the wells. It can be seen 
that the Galerkin solution is highly oscillatory and, although 
adding a Lapidus-type artificial viscosity damps this 
oscillation, it is not effective at breakthrough and gave 
severe stability problems for this problem. The FCT solution 
is precisely monotone and in the lower part of the grid has 
resolved the fronts over three nodes. 


This technique has been applied to multiwell tracer 
injection in irregularly bounded reservoirs and has behaved 
consistently well. Future work will involve testing on the 
two-phase flow equations where FCT has proved successful with 
conventional finite difference techniques (see Christie and 
Bond, 1985), and examining different types of upstream 
weighting and higher-order flux correction terms. 


4. ACKNOWLEDGEMENTS 


Dr. I.R. White is thanked for his contribution to the 
ALCHEMIST program which was the basis for obtaining these 
results, Dr. A. Settari for many valuable discussions, and 
British Petroleum Ple for permission to publish this work. 


2-D FINITE ELEMENT FCT FOR POROUS MEDIA FLOW 619 


5. REFERENCES 


Book, D.L., Boris, J.P. and Hain, K.H. (1975) Flux Corrected 
Transport II: Generalisation of the method. J. Comput. 
Phys., 18, 248-283. 


Boris, J.P. and Book, D.L. (1973) Flux Corrected Transport I: 
Shasta, a fluid transport algorithm that works. J. Comput. 
Phys., 11, 38-69. 


Christie, M.A. and Bond, D.J. (1985) Multi-dimensional Flux 
Corrected Transport for reservoir simulation. SPE13505 
Eighth SPE Symposium on Reservoir Simulation, Dallas. 


Donea, J. (1984) A Taylor-Galerkin Method for Convective 
Transport Problems. Int, J. Num, Meth. Eng., 20, 104-119. 


Gray, W.G. and Pinder, G.F. (1976) An analysis of the 
numerical solution of the transport equation. Water Res. 
Research, 12, 547-555. 


Spivak, A., Price, H.S. and Settari, A. (1976) Solution of 
the Equations for Multidimensional Two-Phase Flow by 
Variational Methods. SPE5123 4th SPE Symposium on 
Reservoir Simulation, Los Angeles. 


Sun, N.-Z. and Yeh, W.W.G. (1983) A proposed upstream weight 
numerical method for simulating pollutant transport in 
groundwater". Wat. Res. Research, 19, 1489-1500. 


Zalesak, S.T. (1979) Fully multidimensional Flux-Corrected 
Transport algorithm for fluids. J. Comput. Phys., 31, 
335-362. 


ye STEAL Pcie 
Rit 


i 


ah 


likikan Ai 
By ci 
MERCAN 


ash RAE 
BRAN, Wrest 


8p 
7 x aN 


to 


NUMERICAL MODELLING OF DISCONTINUOUS ATMOSPHERIC FLOWS 


M.J.P. Cullen 
(Meteorological Office, Bracknell, U.K.) 


l. INTRODUCTION 


Numerical modelling of large scale atmospheric flow is very 
successful. Useful forecasts of the flow pattern are obtained 
out to 3 days in nearly every case and to 5 days in about half 
the cases. The problems occur in converting these forecasts 
into useful statements about actual weather. In order to do 
this the detailed vertical structure of the atmosphere must 
be accurately predicted so that different air masses can be 
clearly identified. 


A large scale weather map, such as those published in the 
newspapers, appears smooth. The dynamical theory of vertically 
averaged atmospheric motion shows that the behaviour is 
essentially the same as two dimensional incompressible flow. 
That system of equations, given smooth initial data, has smooth 
solutions indefinitely: see (Kato, 1965). A correct numerical 
solution can be obtained by any stable consistent finite 
difference or Galerkin scheme. The most effective way of 
obtaining nonlinear stability in this case is to use the 
conservative schemes of Arakawa (1966), or a Galerkin method. 
The concept of smooth solutions extends to three dimensional 
flow provided that the scale remains large and the 
stratification strong. However, detailed observations of the 
vertical structure of the atmosphere indicate that the 
solutions are far from smooth. The structure is often like a 
series of layers of air with markedly different properties and 
sharp interfaces between them: see (Danielson, 1959). These 
interfaces support waves and turbulence. The amount of 
moisture and dust in the different layers is usually very 
different, leading to marked changes in weather when different 
layers reach the surface. 


622 CULLEN 


A system of equations whose scaling allows for the presence 
of discontinuities has been developed by Hoskins (1982) and 
the’existence of solutions for piecewise constant initial data 
shown by Cullen and Purser (1984). There is a need to solve 
the complete equations of atmospheric motion in a way which 
will generate these piecewise constant solutions and model the 


waves and turbulence on the interfaces. As in other branches 
of fluid dynamics when discontinuities develop, it is 
essential to choose the correct conservation law form of the 
equations if numerical solutions are not to converge to 
unphysical solutions. It is also necessary to refine the 
solution procedure to help it to capture the discontinuities. 


In this paper this is illustrated for three situations. In 
forecasting atmospheric fronts, a convergence to the correct 
solution can be obtained by absorbing certain acceleration 
terms into the turbulence model. In the case of flow over 
large scale mountains, the difficulty is to confine the 
influence of the mountains to a shallow layer above its top 
when the stratification is strong. In hemispheric scale 
forecasting, absorption of extra terms into the turbulence 
model can lend to a substantial reduction in error through 
eliminating spurious solutions. However, some useful detail 
is lost at the same time. 


2. GOVERNING EQUATIONS 
2.1 Primitive Equations 


The normal form of equations used in atmospheric modelling 
are the equations of compressible gas flow with the hydrostatic 
approximation. For present Purposes we also make the 
Boussinesq approximation and treat the effect of the Earth's 
rotation as constant in space. The resulting system of 
equations, as used by Gent and McWilliams (1983), is also 
appropriate for the ocean: 


Zet ty =o (2.22) 
tie fu=0 (2.1.2) 
2- o (3 
3 +98/, =0 (2.1.4) 
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BO BT low 


ax * dy F 3z o (2.1.5) 


wagrat z O,H (2.1.6) 


The vertical coordinate Z is a function of pressure, and the 
lower boundary condition is simplified so that it is applied 
at a constant pressure surface near the ground, rather than 
the Earth's surface. 9% represents the height of a constant 
pressure surface, 0 is the potential temperature and f the 
Coriolis parameter. The rest of the notation is standard. 


2.2 The geostrophic momentum approximation 


A scale analysis appropriate to large-scale atmospheric flow 
suggests that the velocities in (2.1.1) to (2.1.6) can be 
approximated by their geostrophic values. 


u = = Yay (23252) 
v = 295 (2.2.2) 


In accordance with the prescription of Lighthill (1961), this 
approximation is only made in the advected quantities, not in 
the trajectories themselves. This allows uniform validity of 
the approximation for large times. Hoskins (1982) shows that 
the resulting system is still valid in the presence of 
discontinuities provided the scale parallel to the 


discontinuity is large. Therefore we replace Cs , Pv) in 


Dt ’ De 
(2.2.2) and (2.1.2) by °g ee 4 : 
Dt ’ Dt 


2.3 Lagrangian conservation form 


Equations (2.2.2) to (2.2.6) subject to the geostrophic 
momentum approximation can be written in the following 
Lagrangian form: 
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Ts- fu, (2.3.1) 
= piles fl £2,352) 
pe o (2.3.3) 
ae =O (2.3.4) 
where (M,N,0) = V(> + $ ENEA + y*)) (233.5) 


2 


and T is the specific volume. The natural boundary condition 
is that no fluid crosses the boundary of the integration 
domain. The work of Hoskins shows that this system of 
equations can generate a discontinuity in a finite time from 
smooth initial data. After this time we assume that the 
Lagrangian form remains valid while the Eulerian form does 
not. The correctness of this can only be tested by actual 
experiment. 


2.4 Implications for finite difference solutions 


If it is assumed that (2.3.1) to (2.3.5) hold for almost 
all the fluid volume, then a finite difference solution of the 
original equations (2.1.1) to (2.1.6) must be made consistent 
with these conservation laws. The requirement of conservation 
of quantities along trajectories cannot only be met in an 
Eulerian scheme by upwinding, however this violates the 
requirement that the volume of fluid with a particular value 
of a conserved quantity is also conserved. The latter 
requirement cannot be met exactly in a conventional scheme. 
The best practical procedure is to use a quadratic conservative 
scheme (Arakawa, 1966) and to use an artificial viscosity 
which acts as little as possible on the vertical component of 
the vorticity, essentially the geostrophic part of the flow. 


3. RESULTS 
3.1 Fronts 


Fig. 1 shows a solution of (2.3.1) to (2.3.5) obtained by 
representing the initial data as piecewise constant, in the 
manner of Glimm (1965) for hyperbolic conservation laws, 
integrating the ordinary differential equations for each 
segment, and fitting the results together in the way described 
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by Cullen and Purser (1984). This method is like a moving 
finite element method except that the volumes rather than the 
shapes of elements are specified. 


Fig. 1 Cross section of a front obtained by a Lagrangian 
method, contours of potential temperature. 


Fig. 2 shows a solution of the Eulerian equations (2.1.1) 
to (2.1.6) obtained by adding artificial viscosity of the form 


D(u-u,) 
z 2 
ne +k Vou (3.1.1) 


to equation (2.1.1), an equivalent term to equation (2.1.2) and 


xv70 to equation (2.1.3). This has the effect of enforcing 

the geostrophic momentum approximation on the scale of the 
discontinuity and allows a value of K to be used which is 100 
times smaller than that required if the extra acceleration term 
is not included in the viscosity. The solution does not 
capture the change in slope with height of the front very well. 
As the resolution is increased to 200 x 40, there is very slow 
convergence towards the solution shown in Fig. l. The 

adaptive method has a clear advantage here. 
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Fig. 2 Cross section of a front obtained by a finite 
difference method using a 200 x 20 grid, contours of 
potential temperature. 


3.2 Mountain flow 


Fig. 3 shows a cross section of the flow over the Alps 
predicted by a limited area forecast model which uses equations 
(2.1.1) to (2.1.6) and also includes the effect of moisture. 


The wind component across the Alps is about 15 mat, A large 
standing wave is generated, also transient waves. 


This structure is only observed in the atmosphere on much 
smaller horizontal scales. on the scale of the Alps, the 
constraint of the Earth's rotation makes the flow tend to go 
round, rather than over the mountain. Even when flow crosses 
the mountain, observations of clouds suggest that the rapid 
flow is confined to a thin layer near the ridge crest. 
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Fig. 3 18 hour forecast potential temperature cross section 
along 8E. 


Fig. 4 shows a two dimensional finite difference solution 
of the equations using a viscosity of the form (3.1.1). This 
suppresses the waves, but the disturbance due to the mountain 
extends up to the top of the atmosphere. 
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Fig. 4 Cross-mountain wind using finite difference model 
on 200 x 20 grid. 


A solution using an adaptive Lagrangian method by S. 
Chynoweth (1985) prevents this extensive disturbance which may 


Play a part in generating the spurious waves in the forecast 
model, 


3.3 Short range forecasts 


Figs. 5 and 6 show 24 hour surface Pressure forecasts made 
using the U.K. operational hemispheric model, in the first 
Case using (2.1.1) to (2.1.6) with artificial viscous terms 
added, and in the second case with acceleration terms 
absorbed into the viscosity as in equation (3.1.1). The 
verifying analysis is shown in Fig. 7. 


_ 


DISCONTINUOUS ATMOSPHERIC FLOW 629 


HK BER) 
AA PS P| 
A SSE 


R : “ Ss, or D; 
j Fe SSS ae ZAA, 
Re 
g 
Fig. 5 24 hour forecast using primitive equation model, 192 x 


‘ow ow or E 
48 x 15 grid. Solid lines-surface pressure (hPa), 
pecked lines-temperature at 850 hPa. 


EJ m 
Fig. 6 24 hour forecast using model as Fig. 5 with artificial 
viscosity (3.1.1). 
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cop 


Fig. 7 Surface pressure analysis for 12Z, 13 July, 1982. 


The forecast with the acceleration absorbed into the 
artificial viscosity is much smoother, despite a 10-fold 
reduction in the coefficient. This forecast is more accurate 
for most of the pressure systems except a deepening depression 
near the North Pole and another in the Pacific. Some strength 
has been lost in these. The improvement reflects the removal 
of much spurious noise, though it has been achieved at the cost 
of losing some correct detail from the forecast shown in Fig. 5. 


The Ly error over the region 70°w to 20°, 30° to 70°N has been 
reduced from 3.2 to 2.4 hPa. 
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4. CONCLUSIONS 


This brief review has shown that numerical procedures for 
solving the governing equations of atmospheric motion may need 
careful design to ensure that spurious solutions are not 
obtained. Turbulence models may have to be designed using 
knowledge of the structure of approximate systems of equations 
as well as the complete system. Adaptive methods can achieve 
far greater accuracy in situations where they are practicable. 
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THE IMPLEMENTATION OF MOVING POINT 
METHODS FOR CONVECTION-DIFFUSION EQUATIONS 


C.L. Farmer and R.A. Norman 
(Oil Recovery Projects Division, AEE Winfrith, Dorset) 


l. INTRODUCTION 


Moving point methods are able to solve high Peclet number, 
multi-dimensional, convection-diffusion equations with very 
small levels of numerical diffusion, overshoot or phase error 
and without excessive mesh refinement. The method uses a fixed 
mesh for the approximation of diffusion processes and a set of 
moving points for the approximation of convection. 


We classify moving point methods into either pure or hybrid 
methods. Pure methods make minimal use of the fixed mesh and 
include the original moving point method of Garder, Peaceman 
and Pozzi (1964) which fails to converge under mesh refinement 
(Price, Cavendish and Varga, 1968). Many pure methods are 
inconsistent but can be modified by appropriate treatment of the 
diffusion terms (Farmer, 1985a). Unfortunately these convergent 
pure methods are too complicated for practical use. 


Hybrid methods make maximal use of the fixed mesh. 
Convergence of these methods is trivial because at sufficiently 
low mesh Peclet numbers the methods are designed to become 
identical to an underlying, convergent, fixed mesh method. 
Relative to pure methods, the hybrid methods are straightforward 
to implement and possess the advantage that local, adaptive 
management of the set of moving points may be accomplished. 

This is done by the insertion or deletion of moving points 
according to the local gradient of the solution. 


A detailed description of the hybrid moving point method may 
be found in (Farmer, 1985b). 
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2. EQUATIONS TO BE SOLVED 


Throughout the following we shall consider various functions 
of a spatial point x in a two-dimensional Euclidean space, E.. 


2 
The components of x will be denoted by Xj (i = 1,2). Let 2 be 
a finite, singly connected, subset of E, with boundary 
d = IN UIN,- The symbol t denotes time. Let u = u(x,t) be a 
continuous time-dependent solenoidal vector field in Ent called 
the velocity. 


We wish to find a scalar field c = c(x,t), called the 
concentration, satisfying the conditions; 


ae + u.Ve = V.D.Vc xe2,t > O (2.1) 
c(x,0) = co (x) xe (2.2) 
e(x,t) =1 xeon, ,t 20 Cain) 

ac 
owas o xe dQ, t 20 (2.4) 


where D is a positive second rank tensor called the diffusion 
tensor. S5 is a functicn defining the initial condition and 


the symbol 0/dn denotes the outward pointing normal spatial 
derivative. We assume that on an, (R) characteristics enter 


(do not enter) the region 2. 
3. THE MOVING POINT METHOD 


3.l Definitions 

Let N be a rectangular region which is partitioned into a 
union of M identical rectangular mesh cells, Pa (m= l, ..., M). 
Let x” denote the coordinates of the centre of a Introduce 
the discrete time ie = NT, T 20, n=0,1,2,... where T is the 
time step. Introduce a mesh function with values on which is 


interpreted as an approximation to the concentration at the 


point x" ata time ti 


Bnd! 
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Introduce a set of moving points {P, : k = 1,2,...,N} with 
distinct coordinates x, in E, at time t . Let a be the 
2 n 


concentration on moving point Pk at time to which Pa 

interpreted as an approximation to o(xprt,) + Let Vs be the 
total number of moving points with co-ordinates in a, at th: 
If the point Pk is nearer to x” than any other moving point 


then we write xe Qo 


3.2 Fixed Mesh Interpolation 


Let Te denote the fixed mesh interpolator which from the 


fixed mesh values Fo constructs a function ca according to 
G(x) = 12 [FoX] Xe (3.1) 


where the index £ ranges over the indices of the fixed mesh 
points. s is the order of the interpolator. Examples of such 
interpolators are piecewise constant interpolation (s = 1) and 
piecewise bi-linear interpolation (s = 2). 


3.3 Fixed Mesh Algorithm 


n n r 
TE xe Ak and cj are known at bo then the hybrid moving 

point algorithm requires the construction of a first 
approximation oa to at We will use the finite difference 
version of the modified method of characteristics (Huffenus and 
Khaletzky, 1981; Douglas and Russell, 1982; Hartree, 1953). 
Convergence of the algorithm has been proved by Douglas and 
Russell (1982). 


We will denote the construction of qt by the fixed mesh 
algorithm using the notation 
antl n 
Ce er a bc (3.2 
Fahy ES) ) 


3.4 Moving Point Interpolation 


Let ty denote the moving point interpolator which from 


moving point function values Gy and fixed mesh values FL 


n 
constructs a function Q according to 


n Trn n n 
Q(x) =I ley Fi x] XEN (3.3) 
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where the indices k and i range over the indices of the moving 
and fixed mesh points respectively and s is the order of the 
interpolator. A variety of interpolation methods are possible 
but in problems in which the concentration possesses some 
smoothness the following algorithm is appropriate. This is 
based upon a Taylor expansion about the nearest moving point to 
a cell centre using spatial derivatives evaluated on the fixed 
mesh, 


n n n =, n 
Gia (hae x) Viel For Xl Xen r KER, vi > O 
OF (x)= (3.4) 
2 n D e 
Ip [F x] XeQ_, Ya = 0 


where s 2 3. In most applications we use s = 3 but in very 
smooth problems we use s = 4. 


In problems in which the concentration is not smooth 
relative to the fixed mesh then the use of the nearest grid 
point interpolator (NGP interpolation) is appropriate; 


gy XEN , ken , v2 >o 
m m m 
g(x) = (3.5) 
2 n 
1 [Fy x] v= 0 


In our codes we combine these interpolators according to the 
magnitude of the local concentration gradient such that if it 
exceeds a user defined value we use NGP interpolation and 
otherwise the Taylor method. 


3.5 Point Moving Algorithm 


n+. 
To obtain Xk 1 from x we solve the equation 


dz 
=a ulen) (3.6) 
with the initial conditions z(0) = x, (k =1, ..., N) using T 


steps of the classical 4-th order explicit Runge-Kutta method 
with time step T/T. T is an integer which may vary during 
the course of a calculation. 


MOVING POINT METHODS FOR CONVECTION-DIFFUSION 639 


We denote the operation of moving the points by the symbol 
E and we write 


ae =E [X wet), 1/1] (3.7) 


3.6 Local Insertion-Deletion of Moving Points 


In many problems the concentration is almost constant, with 
values of nearly zero or unity, in a large fraction of 2 and 
possesses sharp fronts separating the regions of different 
values. One only requires moving points in the regions occupied 
by, and near to, sharp fronts. It is thus useful to implement 
some strategy for the insertion or deletion of moving points in 
R. 


The operation of insertion and deletion will be denoted by 
the symbol J and we write 


x =J [x] (3.8) 


A description of an insertion-deletion algorithm may be found 
in (Farmer, 1985b). 


3.7 The Hybrid Moving Point Algorithm 


Using the initial condition we assign (os for each fixed 
mesh cell, x and ap for each moving point. The x are chosen 


so that there are, say, 10 points per cell near to regions of 
large spatial gradient and, say, 1 point per cell away from 
such fronts. The points are most conveniently placed in the 
cells using a uniform random number generator. 


j n n n+l 
At time ps suppose Cu x and Ak to be known. The Ci ' 


+l 
x and i are constructed as follows. 
Step 1 - Update the moving point positions to obtain o by 
+1 n 
x =E [X wlxt)), 1/1] (3.9) 
choosing T so that over any time interval, T/T, no moving point 
travels a distance of, say, more than 1/3rd (or other user 


specified value) of a mesh cell in any direction. 


Step 2 - Perform the insertion-deletion procedure 
cab +l 
x = J (x, ] (3.10) 
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Step 3 - Update the fixed mesh concentrations using the fixed 
mesh algorithm to obtain the first approximation 


Ga Sp n 
a = otc] (3.11) 


Step 4 - Calculate the changes, scr, due to diffusion by 


6c™ = tA. (DA)C™ (3.12) 
m m 


where A. (DA) is the central difference approximation to V. (DV). 
We must test that t is small enough to satisfy the usual 
stability condition for an explicit central difference method 


for a diffusion equation. ech can be obtained as part of Step3. 


Step 5 - Update the moving point concentrations by setting, for 
each k, 


Ci a SERASA s 2 n +1, 
NEES oe Sure TAN x ] (3.13) 
Step 6 - Construct the fixed mesh concentration es cl by the 
interpolation 
nae s p ntl —*ntl n+l 
ae n ee Ae ae a a aN 
where 


antl ~n+l 
Ga aie S 22m 
(3.14) 


Pees (hors 


and 2 and < in (3.14) denotes an ordering relation on the mesh 
cell indices. À is a weighting factor, dependent upon the 
lowest upper bound, Pe,, Of the mesh Peclet number. The 


function à is chosen via numerical experiments in such a way 
that the hybrid moving point method is trivially convergent. A 
suitable ì is 


(3.15) 


(i 
fo} 
wW 
o 
A 
6 

Li 


Taio 


us 
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4, REMARKS CONCERNING IMPLEMENTATION 


In a two-dimensional problem we require three arrays, X, Y 
and A, storing the x- and y- coordinates and concentrations on 
the moving points. It is useful to scale X and Y so that the 
integer parts of the elements of X and Y give the indices of 
the mesh cells in some suitable ordering. 


In our codes we utilise 'buffer' cells all around the 
physical cells with zero velocities on their outward facing 
boundaries. This is so that when moving points leave the 
physical region they may still be referenced in the inner loop 
executing the Runge-Kutta method for moving the points. 


In the insertion-deletion step we test each point and delete 
it from the data structure if 


(i) It is outside the physical region. 


(ii) There are more than the maximum number of moving points 
in the fixed mesh cell. 


A convenient method of deletion is to decrement by one the 
index of all the remaining moving points. 


Finally, we would assign a moving point to the centre of 
each fixed mesh cell which contains fewer than a required 
minimum number of moving points. The concentration assigned 
to the moving point is that of the cell centre. 


As work space we require two arrays of the same size as the 
fixed mesh concentration array to store the index of the moving 
point nearest to the centre of each mesh cell and the total 
number of moving points in each cell. 


5. NUMERICAL TESTS 

In this section the components of x will be denoted by (x,y) 
and the components of u by (u,v). The diffusion tensor is 
assumed diagonal and constant with common element, D. 
5.1 One-Dimensional Test 

A solution of the constant coefficient convection-diffusion 


equation in an infinite interval is 


c_(x,t) =4- 4 erf (Z—Bt (5.1) 
E 2 7dt 
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If we choose constants u, D and ew appropriately then the 


function 
c(x,t) = c (x, ttt.) (52) 


is, for small enough t, a good approximation to an exact 
solution on a finite interval. The solution (5.1) has the form 
of a smoothed step function with values between O and 1. 


In Table 1 we give numerical results which investigate the 
effect of mesh refinement. At the initial time we have placed 
four moving points in each mesh cell using a uniform random 
number generator. We have used a second order Taylor moving 
point interpolator. These results were obtained using 
t, = 0.25, u = 1:0 and D = 5.10 4, The width, w, is the number 
of mesh cells in which the mesh cell values are in the interval 
[.1, .9] at t = 0. The width is a measure of how smooth the 
front is. 


A more detailed investigation of convergence behaviour may 
be found in (Farmer, (1985b). 


Distance Maximum absolute 
Pe. Width Convected error as percentage 
(cells) (cells) of unity 
100 1 lo 9.174 
50 2 20 4.100 
25 4 40 1.004 
12.5 6 80 0.1947 
Table 1: 


Maximum Absolute Error as a Function of Mesh Peclet Number 


5.2 Two-Dimensional Test 


To test the performance of the hybrid moving point algorithm 
on an example with curved characteristics we study the standard 
rotating cone problem with (u,v) = (-y,x). We used a 43 x 43 
mesh (41 x 41 physical mesh) of square cells of unit side. 
Reflection boundary conditions were used and the origin was 
placed at the centre of the mesh. At the initial time a single 
moving point was placed at the centre of each mesh cell. 

The insertion-deletion procedure ensured that at the end of 
each time step there was at least one moving point in each mesh 
cell. 100 time steps were used for a complete revolution. The 
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Runge-Kutta method computed the position of the moving points 


with an absolute error in each component of less than 10> of 

a mesh length. As the initial condition we used the function 
2 2 

aR y] (5.3) 


c 
o 


where a is a parameter controlling the width of the front. 


In Table 2 we show the maximum absolute error during the 
rotation and the maximum absolute error at the end of a 
complete revolution. A second order Taylor moving point 
interpolator was used except for the first result where the 
NGP method was used. In Table 2 the width parameter, w, is the 
number of cells across a diameter of the cone parallel to the 
x-axis on which the mesh cell centre concentration is greater 
than 0.1 at t = O. The increase in error for the smoothest 
cone is a result of the increasing influence of the approximate 
reflection boundary conditions. 


Maximum Absolute Error 


a Width as percentage of unity 
(cells) 
During revolution At completion 
1/15 1 97.34 0.0 (NGP interpolation) 
X 3 41.30 3.391 1074 
5 7 14.02 2.318 104 
15 1l 6.492 1.934 qos" 
Table 2: 


Maximum Absolute Errors in the Rotating Cone Problem 


6. CONCLUDING REMARKS 


A hybrid moving point method has been described and some 
features of computer implementation discussed. The method has 
been shown to be very accurate in both ohe- and two-dimensional 
problems. 


The structure of the method is such that the convection- 
diffusion problem is essentially reduced to a problem of 
interpolation on scattered data sets. This interpolation is 
the main source of error and so in any practical problem the 
expected numerical error can be estimated from estimates of the 
solution gradient and the order of the moving point 
interpolator. 
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The hybrid algorithm is suitable for application to practical 
problems (Farmer, 1985b) and we are currently using it in the 
two-dimensional simulation of flow in oil reservoirs. 
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THREE DIMENSIONAL FREE LAGRANGIAN HYDRODYNAMICS 


H.E. Trease 
Computational Physics Group (X-7), 
Los Alamos National Laboratory, New Mexico) 


1. INTRODUCTION 


The purpose of the following discussion is to describe the 
development of a 3-D Lagrangian hydrodynamics algorithm. 
The 3-D algorithm is an outgrowth of the 2-D free Lagrange 
model that is fully described in Trease (1981). Only the more 
pertinent issues of the free Lagrange algorithm will be 
presented; the details of the rest of the code development 
project are interesting but not appropriate in the context of 
a free Lagrange discussion. Let it suffice to say that a 
complete production code is being developed to support the 
free Lagrange algorithm to be described. A graphic description 
that outlines this code development project is presented in 
Figure 1. 


The main objective of this project is to develop a computer 
model that can be used to simulate fluid flow in three 
dimensions. The inspiration for using free Lagrange as a basis 
for a hydrodynamics code was gained from the work of Crowley 
(1971), Fritts and Boris (19/9), and Kirkpatrick (1976). The 
2-D code, described in Trease (1981), was based on this 
previous work. This two dimensional model showed several 
attractive features about free Lagrange. First, it showed that 
free Lagrange can be used to handle fluid flow problems that 
exhibit strong shearing forces which classically could only be 
handled by Eulerian type algorithms. 


Second, the accuracy of a free Lagrange algorithm was 
shown to be sufficient, with the irregular mesh, to produce 
credible solutions. Third, due to arbitrary connectivity of 
the free Lagrangian logic, meshes can be variably zoned. This 
allows the user to put the resolution where it is needed. 
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All three of these features of free Lagrange have been 
exploited to extend the 2-D model to include the third 
dimension. In doing so many aspects of large scale code 
development have been investigated. Several modern software 
tools have been used to make the manipulation of the arbitrary 
connectivity matrix, that is associated with free Lagrange, 
easier. For reference these tools include; a dynamic (heap) 
memory manager, a storage block manager and also a relational 
data base manager. 


CONTROLLER 
LEVEL 


PHYSICS 
CONTROLLER 


RADIATION 
DIFFUSION 


CONTROLLEE 
LEVEL 


DISTRIBUTED 
PROCESSING 
LEVEL 


MULTITASKING 
LEVEL 


Fig. 1 Block structure of the 3-D code, illustrating the four 
levels that make up the code. 


2. DESCRIPTION OF THE FREE LAGRANGE ALGORITHM 


The main feature of the free Lagrange algorithm that 
identifies it from a standard Lagrange algorithm is the 
connectivity matrix that both defines the nearest neighbours 
for each point and the shape of the computational cells over 
which the fluid equations are integrated. A construction 
technique, that creates a VORONOI mesh, is used to identify 
nearest neighbours and define the mesh cells. The Voronoi mesh 
that is constructed has several properties that make it an 
excellent choice for maintaining the connectivity matrix for 
the 3-D code. ‘hese are: 


A) The set of resulting polyhedra map the space defined by the 
mass points and bounding surfaces uniquely, i.e., none of the 


| 
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polyhedra overlap and nearest neighbours are guaranteed to be 
reciprocal. 


B) Each polyhedron remains convex. This is accomplished by 
changing the area and the number of faces, i.e., neighbour 
swapping. 


C) The volume and surface area of each polyhedron changes 
continuously. These and other aspects of the Voronoi mesh will 
be discussed more thoroughly in later sections. 


One of the more important goals of the 3-D code is to be 
able to couple various hydrodynamic algorithms together. This 
means that free Lagrangian hydro will be used in regions where 
the flow field is most distorted. Then, in the (more) 
well behaved regions we will use an adaptive rezoning technique, 
with a mesh composed of mass points that have a fixed 
connectivity. These two algorithms will then be coupled 
through a third algorithm called a ZOC. The free Lagrange and 
the ZOC algorithms will now be described in detail. The 
detailed hydrodynamic equations, that are solved by these 
algorithms, along with their finite difference representations 
are discussed in detail in Appendix A. 


The basic features that describe the free Lagrange algorithm 
are listed below: 


A) All mesh quantities are cell centred. 


B) The computational domain is described by an arbitrary 
distribution of mass points. 


C) The code automatically constructs its connectivity matrix. 
D) Mass points can be merged and/or added to the mesh. 


The code determines its connectivity matrix by constructing a 
unique polyhedron about each mass point. The resulting 
polyhedral mesh is known as a Voronoi mesh. The faces of the 
polyhedron determine the set of "nearest" neighbours with 
which a mass point interacts. The faces of the 

polyhedron are represented by intersecting perpendicular 
bisecting planes between a given mass point and each of its 
neighbours. The details of this construction process are 
described more fully in Appendix B. Figure 2 shows several 
examples of Voronoi cells. The set of polyhedra that describe 
the mesh completely and uniquely span the space over which the 
mass points are distributed. Figure 3 shows a 2-dimensional 
projection of a 3-dimensional mesh, where the arbitrary 
polyhedra reduce to arbitrary polygons. 
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Fig. 2 Examples of several Voronoi cells. The Voronoi cells 
are the polyhedral shaped objects. The straight lines 
that end at a point represent the connections between 
the central mass point and its "nearest" neighbours. 


Due to the fact that all physical quantities are carried at 
cell centred mass points, each point can change the set of 
"nearest" neighbours that it associates with by changing the 
shape of the polyhedron surrounding it while still retaining 
its Lagrangian definition. The neighbour changing process is 
smooth and continuous because of the integral nature of the 
algorithm. Two points become neighbours when a face with 
"epsilon" surface area appears between them. These points will 
drop each other as "nearest" neighbours when (and if) this face 
area shrinks below "epsilon". 


There are several advantages and disadvantages associated 
with free Lagrange hydro. The advantages are obvious to 
anyone doing hydrodynamic calculations. Due to the arbitrary 
connectivity of the mesh and the ability to change this 
connectivity, highly distorted flows can be modelled by using a 
Lagrangian algorithm. Also, since the mesh maintains itself, 
no manual rezoning is needed (this is extremely important in a 
3-dimensional code). Complex geometries that require variable 
zoning can be set. up relatively easily since the code calculates 
the connectivity matrix from an arbitrary distribution of mass 
points. The main disadvantage of this method is the overhead 
associated with maintaining and processing the connectivity 
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lists, but since the neighbour lists are unique, they are very 
amenable to a calculation using a multi-tasking algorithm 
(i.e., the neighbour searches can be done in any order, but 

the resulting global connectivity matrix is the same). Also, 
future machines that support hardware gather-scatter operations 
will improve the efficiency of this algorithm. 


EH | 
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Fig. 3  2-dimensional projection of a 3-dimensional Voronoi 
mesh. 


One of the weakest aspects of the free Lagrange method 
described is the treatment of continuous interfaces. This 
results from the fact that the edges of the computational 
volumes are arbitrarily defined to be midway between two 
"nearest" neighbours. This definition, while consistent with 
the Lagrangian equations, leads to a poor definition of a 
continuous interface. The realization of this fact suggested 
that an interface tracking algorithm was needed to follow the 
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motion of interfaces. We will now describe the algorithm that 
is being used and how we intend to develop it into an all 
encompassing interface tracking algorithm, along with some 

of the positive side effects, in relation to distributed 
processing and slip-line treatment. 


A little reflection on two key properties of interfaces will 
help the reader's understanding of the algorithm to be 
described. First, a continuous interface separates what could 
be considered immiscible fluids, i.e., fluid "A" remains 
distinct from fluid "B" even though interpenetration may occur. 
Second, an interface can be described in a space that is one 
dimension less than the rest of the problem. In one dimension 
an interface is a point, in two dimensions it is a line, and in 
three dimensions it is a surface. Generalizing this idea we 
can represent an interface as a (N-1) construct in a 
N-dimensional space. Taking these two concepts into 
consideration we derived an interface tracking construct called 
a ZOC. An example of a ZOC is shown in Fig. 4, where several 
observations can be made. First, we can see that the interface 
separating the two fluids is distinct. Also, we notice that 
zoning away from the interface in the two regions is 
discontinuous with respect to the other region and the 
interface. 


Most of the technical details of maintaining a ZOC will not 
be discussed, but some of the more general aspects of this 
surface tracking concept may be interesting. These are listed 
below: 


A) A ZOC is essentially a special free Lagrange region that 
uses its connectivity matrix to connect to the surrounding 
regions. 


B) Points can be added or subtracted from a ZOC to maintain 
the interface. This process is made especially easy since a 
connectivity matrix is used to connect points and thus the mesh 
reconnections account for the fact that a point has been added 
or subtracted. The process of adding new points is trivial 
because the Voronoi mesh indicates when a new point should be 
added and where the new point should be located. The rest of 
the detail of adding a point involves the redistribution of 
mass, momentum, and energy in a local region of space. 


C) A ZOC will work in two dimensions, to maintain a line 
interface, just as well as in three dimensions. 


D) The treatment of slip-lines should be automatic with a ZOC 
since there is no restriction on the tangential velocity of 
the fluid on either side of the interface. 
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The lower figure shows 


REGION 2 


FS 
The upper figure shows the ZOC in 


V 


FREE 
SURFACE 

Y 

Y 


3-D FREE LAGRANGIAN HYDRODYNAMICS 
relation to the free surface. 


the corresponding grid. 


Fig. 4 Example of ZOC. 
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E) This interface will be used to connect regions that use 
different hydro algorithms. This means that a free Lagrange 

algorithm can be used on one side of the interface and a fixed 

mesh algorithm on the other side. 


F) A ZOC makes a natural communication buffer for connecting 
two separate algorithms that are running as distributed 
processes. This is where the dimensionality of the ZOC 
becomes important because the data transfer between processes 
must be kept to a minimum. The separate region processes are 
N-dimensional data structures and a ZOC is a (N-l) data 
structure, which means the amount of information being 
communicated between the regions is small compared to the 
regions themselves, 


APPENDIX A 


THE HYDRODYNAMIC EQUATIONS AND THEIR FINITE DIFFERENCE 
REPRESENTATION 


The hydrodynamic equations that are solved on this free 
Lagrange system of mass points are given below. These | 


equations represent the conservation of mass, momentum, and 
specific internal energy, respectively; 


Continuity Equation; 


1 Sing 
=O, Ta, (A.1) 


p= =-Vp - [Vy], (A. 2) 


Conservation of Internal Energy; 


De aim = 
O Sent -p(V.u) - (Y:Vu) , (A. 3) 

where j 
p = fluid density, | 
p = fluid pressure, | 
€ = internal energy/unit mass, | 
u = fluid velocity vector, 
Y = total stress tensor, and | 


n 


T+Ọ, 
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where 
T = stress tensor and 
Q = artificial viscosity tensor 
ae ar io 
2% div(u) { (Vu) - 3 div(uje} , if div (u) <o 
Q= (A.4) 
o , if div (u) > 0, 
and where 
div(u) = divergence of velocity vector u, 
Va = dyadic product of the differential operator V and 


the velocity vector u , 


oll 
" 


unit tensor , and 
& = constant * local grid spacing. 


The algorithm used to solve these equations can be outlined as 
follows. First, Equations l, 2 and 3 are integrated over an 
arbitrary volume (in reference to the code we integrate over a 
computational cell). The volume integrals are transformed to 
surface integrals by using the divergence theorem. The mean- 
value theorem from calculus is used to obtain average 
quantities. These resulting equations are then cast into 
finite difference form as shown below. The following notation 
will be used in writing the finite difference representation 
of the fluid flow equations; 


i = mass point i, spatial position at (XI, YI, ZI), 


j= gee nearest neighbour of mass point i, spatial position 
at (XJ, YJ, ZJ), 


J = total number of nearest neighbours associated with mass 
point i, 
n = present time step (time = t), 
n+l = next time step (time = t + At), 


= distance from mass point i to nearest neighbour j, 


i,j 

j,j 7 area of polygon face separating mass point i from 
‘J nearest neighbour j, 

vi j = volume of the polyhedron associated with mass point i and 
, 


nearest neighbour j, 
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M, = mass of fluid associated with mass point i, 


P, . = fluid pressure at the face associated with mass point i 
and nearest neighbour j, 


, = fluid density in cell i, 


U, . = fluid velocity at the face associated with mass point 
t;J i and nearest neighbour j, and 
Ay , = normal vector to the face associated with mass point i 
‘J and nearest neighbour j. 
+1 Dp ne “ntl 
A Sahil E Pik WS en te eta (A.5) 
LIEDE $ EEEE e Pis > e ER Gia 
jal 
DU s, is = 
M, (— Aa PAL Bese buy cli A (A.6) 
ibe a goa tH 4 4 ad a aed * 
J = J = 
De is +1 2 = 
Hit EEE N OA ENA UR OAR eon env ALTO], A. . 
i De a ae er ae E E ner 
— J a = 
apd aes IB ANG gee EER aN 4 ty (A.7) 
30 je, 5 i,j i,j 
APPENDIX B 


MESH CONNECTIVITY (NEAREST NEIGHBOUR CALCULATIONS) 


The purpose of this Appendix is to describe the manner in 
which the connectivity matrix for the free Lagrange algorithm 
is calculated. The connectivity matrix contains the "nearest" 
neighbours for all the mass points. These connections are used 
for calculating surface areas and volumes of the Voronoi cells 
that make up the computation mesh. In addition to describing 
the geometry of the cells, the connectivity matrix indicates 
which cells will interact hydrodynamically. 


Each Voronoi cell is made of an arbitrary number of 
intersecting planes. These planes construct a convex 
polyhedron with an arbitrary number of "faces" about each mass 
point. Each face forms a polygon with an arbitrary number of 
"edges". The trick is to devise an algorithm that can 
calculate the connectivity matrix from an arbitrary 
distribution of points. As we proceed through this discussion 
the two following definitions should be kept in mind. 
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A) "face" neighbour: Any two points that are "nearest" 
neighbours are separated, in 3 dimensions, by a polygon shaped 


"face". A "face" neighbour therefore refers to the "nearest" 
neighbour that is across a given "face" from a given central 
point "I". There is a one-to-one correspondence between the 


number of "face" neighbours that are associated with a point 
and the number of "faces" that make up the polyhedral cell 
surrounding that point. 


B) "edge" neighbour: Each of the "faces" of a Voronoi cell 
is a polygon. Each vertex of a given polygon "face" is found 
to be the centre of a sphere that passes through four points. 
The four points are: the central point "I", its "face" 
neighbour "j", and two other points that are called "edge" 
neighbours (they are referred to as "k-1" and "k"). Also, 
since each polyhedron is a closed figure each "edge" neighbour 
must also be a "face" neighbour. It should be noted that there 
is a one-to-one correspondence between the number of "edges" 
on a given polygonal "face" and the number of "edge" neighbour 
associated with that "face". 


The main idea in discovering the "nearest" neighbours of a 
point is to identify the list of "edge" neighbours for each 
"face" of the polyhedron surrounding that point. As each set 
of "edge" neighbours is discovered they are put on a stack of 
"face" neighbours, then the next "face" neighbour to be looked 
at is pulled from this stack. When the stack of "face" 
neighbours is empty the Voronoi polyhedron is complete and the 
connectivity for this given point has been found. 


The process of constructing the connectivity matrix for a 
given point is described more completely in the following 


steps: 


A) Assemble a list of "possible" neighbours. This process 
depends on whether or not the point has been calculated before. 


a) If the point hasn't been calculated before: 


1) use any "logical" neighbour information to select as 
many potential neighbours as possible. 


2) use a proximity rule to select a set of closest points 
based on filters such as distance between and material 
type. 

b) If the point has been calculated before: 


1) recall the old connectivity matrix for a given point. 
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2) form the list of possible "nearest" neighbours from the 
old "nearest" neighbours plus the "nearest" neighbours 
of the old "nearest" neighbours. 


B) From this list of "possible" neighbours we identify the 
1n "face" neighbour by the procedure outlined below. 
a) First all points are translated so the coordinate axes 


are centred at the central mass point, point "I", by 
setting 


> > > 
x) = Xa - x, (2=1,...,nkin) (A.8 


where 

> 

x = coordinates of a possible neighbour "2", 
$ 

x, = coordinates of point "I", and 


nkin= number of points on the list of "possible" neighbours 
for point "I", 


b) Select the first "face" neighbour by defining 
following equation, 


N) = Index of [min |X, H (2=1,...,nkin) (A.9) 
LA 
where 


EA = the distance to point 2 and 


Ny = index of the PE "nearest" neighbour (this is the index 
into the global mesh arrays). 


c) Select the first "edge" neighbour for the first "face" 
by defining 


E) 1 = Index of [min {w | (Š), x Š), | + 
fea 
' (eee NABER ES (A.10) 


> > 
+ Wy cos (OK) 1+ Šo) + Wy x, 131 (2=1,...,nkin) 
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where 
Ena N 
Eh aegis th 
Ek j = the k edge" neighbour of the j "face" neighbour, 
, 


COR the coordinates of the "face" neighbour "N.", 


W, = weighting parameters f > Wy > w3); 


(È ),= the coordinates of the PR Voronoi point. This point is 


found as the centre of a circle which passes through 
three points. These points are: the central point "I", 
the "face" neighbour Wie and the next possible 


neighbour "2" (Ny 7 Ny). The equations, in matrix form, 


that determine the (X,Y,Z) - coordinates of the get 
Voronoi point are: 


ig 


es > 2 
Aj KESH 


> > + 12 

X, 3 (%) 9 f= nek |x, | P 
> 

X? x Xy o 


NOTE: This step describes the "nearest" neighbour algorithm 
in 2 dimensions (Z=0). Here we just use it to "start" the 
3-dimensional algorithm described below. 


da) The process of selecting the rest of the "nearest" 
neighbours for point "I" serves a dual purpose. First, we 
finish calculating the rest of the "face" neighbours and we 
also discover the list of "edge" neighbours that make up the 
polygonal "face" that separates points "I" and "j" An 
important point to notice is that the list of "nearest" 
neighbours for point "I" are contained within the sets of 
"edge" neighbours, i.e., the "nearest" neighbours are a subset 
of the "edge" neighbours. Therefore, by sifting the "edge" 
neighbours we obtain a list of unique points that represent the 
"face" neighbours for point "I". The "edge" and "face" 
neighbour lists will bind each other to completely describe the 
polyhedral cell surrounding point "I". The 3-D "nearest" 
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neighbour selection algorithm is described below. 


1) We already know the first "face" neighbour, Nyy and the 


first "edge" neighbour, , for face 1 for point "I". 


E 
1,1 
These were found in Steps (B.b) and (B.c). 


2) Now we calculate the "edge" neighbours for "face" j- 


x > > > 

E, j 7 Index of laln {w, (San 2 (Le) Š a(x] 
> > > > > > 

eI OR are 24) 51] +W, cos ((X jr Xo) +W cos (X 1 + žo 


+ Wy [X)[}] (2=1,...,nkin) (A.11) 


JEA Lj 


j = index of the current face, 
k=2t = = " n 
o K (until E PLI where K=number of "edge" 
neighbours), 
WwW = 


= weighting parameters > > 
i "wi W Ws >W 4 


e 
4) 5= coordinates of nearest j, 


me $ th 
(X) o coordinates of the 2~° Voronoi point and 
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2 
> > 
BH, ESN 
> > 3 $ 2 
Xecuter. MS lel : 
> re 
Xg 1%, 


3) From the list of "edge" neighbours, (Ey, k=1, K), we add 
the unique indices of the list of "face" me tgus (Nii j=l, J). 


4) Increment j and continue with Step (d.2). This continues 
until all the "face" neighbours have been calculated, i.e., 

J > j. J is the number of "face" neighbours that are 
associated with point "I". 
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PARAMETRIZATION OF VISCOSITY IN THREE DIMENSIONAL 
VORTEX METHODS AND FINITE - DIFFERENCE MODELS 


S.P. Ballard 
(Meteorological Office, Bracknell, U.K.) 


1. INTRODUCTION 


Three dimensional vortex methods, such as described by 
Leonard (1980) and Chorin (1982), provide a description of 
inviscid, incompressible fluid flow in terms of the evolution 
of vortex filaments or line vortices with small but finite 
cores. 


This paper will discuss the results of inviscid simulations 
that identify the close alignment of opposite signed vorticity 
as important in allowing rapid stretching of vorticity whilst 
conserving energy. This suggests a method of parametrizing 
the effects of viscosity on the small scales of motion 
cancelling these sections of the filaments and relinking the 
remainder. 


Results of simulations of the interactions of vortex rings, 
with the above parametrization of viscosity included, are 
compared with the observations of Fohl and Turner (1975). 


The simulations are being used in order to evaluate 
turbulence models used in finite difference integrations of 
three dimensional flow. The implications are discussed. 


2. CHORIN'S THREE DIMENSIONAL VORTEX METHOD 


The Euler equations for three dimensional, incompressible, 
inviscid flow in vorticity formulation are 


Do ow > > > > 
O Spd, a 
D DE + (u.V)w (w.V)u (2.1) 
where 
o=Vxu, V.u=0 (2.2) 
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> > as 
with w the vorticity vector, u the velocity and V the 
differentiation vector. 


The solution of equations 2.2 is the Biot-Savart Law for the 
velocity field, 


>o 1 [Gx x wee) axe 
u(x,t) =- 5 f x F FOA (2.3) 
x - x' 


The vorticity can change with time, from equation 2.1, due to 
the advection and stretching of vorticity. 


The vortex method used here is based on that described by 
Chorjn (1982). It is assumed that the vorticity field can be 
described in terms of a sum of M vortex filaments of small but 
finite cross-section, o, with equal and constant circulations, 
T. The vortex filaments are further approximated by Ny 


straight line segments of length as; , i=1,M, j=1,Ni. The 


motions of the end points of the segments, nodes, are followed 
so that vortex stretching is represented by the increase in 
length of the line segments. The maximum length of the 
segments is limited to some value, So: Once this value is 


exceeded the segment is split and an extra node is followed at 
its midpoint. Therefore the number of nodes followed increases 
as the length of the filaments, and hence vorticity, increases. 


The velocities are calculated using a smoothed and 
discretized form of the Biot-Savart Law. 


M Nii i eed Fedele HAN Ne LE 
Tede <a i z T pi (x-x, )x y (x-x, )As, ds, (2.4) 
i= 


1 j=1 Sin 
|x-x, 
A J 
> k 
where ii is the unit vector directed along the tangent to the 
+ 4 A >i 
jth segment of the ith filament and x," is the midpoint of the 
>> i 
segment. Aa ) is a smoothing function of width o required 
to remove the singularity of the integral whilst distorting 
close vortex interactions. 


3. INVISCID SIMULATION TO STUDY VORTEX STRETCHING 


3.l Initial data 


A simple situation in which to study vortex stretching is a 
periodic unit cube containing two perpendicular rectilinear 
filaments separated by distance 0.1 in the mutually 
perpendicular direction. In this situation there is initial, 
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mutually induced, strain along both filaments in the direction 
of their vorticity vectors producing stretching. 


A constant value of g = 0.05 was used and 


v(a) = o Jal>o.495 
= a<|a|<o.495 (3.1.1) 
= J|? [2l<o 
oo 


similar to the form used by Chorin (1982). Image contributions 
within distance 0.495 were included in the velocity integral 
and a fourth order Runge-Kutta time integration scheme was used. 


3.2 Results 


As the flow evolved the filaments stretched and became 
increasingly chaotic and tangled. By the end of the 


integration some segments had stretched by a factor of lo; 
However, despite the rapid increase in vorticity, energy, as 


calculated on an 80° grid, had been conserved in the flow. 


Chorin (1982), simulating the evolution of a turbulent 
vortex, observed that the constraint of energy conservation 
forced the filament into folds or tight tangles. In this 
simulation we observe principally pairing of opposite signed 
vorticity, where sections are aligned and very close, between 
either the same or both filaments, see Fig. l. 


In these dipole pairs the contributions of their vorticity 
to the velocity fields away from their location almost cancel 
and so they only have a small net effect. They are also partly 
responsible for the observed rapid stretching of the filaments 
as they move through the fluid, with their own induced 
velocities, away from the unpaired sections. 
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T = 0-398 | 
X— | 
Fig. 1 Three dimensional representation of configuration of | 
two initially perpendicular vortex filaments at 
t = 0.398 for o constant. 
Filament 1 initially at (0,y,-0.05) 
_ Filament 2 initially at (x,0,0.05) 


= Direction of vorticity vector 


3.3 Accuracy of solution 


Vortex filament methods do have limitations (Leonard, 1985) { 
and are unlikely to be accurate once the radius of curvature 
or separations of filaments are less than o. In Ballard (1985) 
it was found that, when different smoothing functions were used 
in the above simulation, the solutions diverged once separations 
became less than o. However, all solutions showed evidence of 
dipole pairing. Runs where o was allowed to reduce as the 
segments stretched indicated that the use of a constant value 
of o underestimated the rate of increase of vorticity. The 
process of dipole Pairing occurred earlier and more markedly 
as shown in Fig. 2. Obviously further numerical, theoretical 
or analytical calculations are required to produce a rigorous 
justification of dipole pairing but logical arguments and 
observations can be put forward in favour of the feature and 
are discussed in the next section. 
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T = 0-1887 


Fig. 2 As Fig. l but at t = 0.1887 for o varying with local 
segment stretching. 


Filament 1, — — — Filament 2 


Direction of vorticity vector 


4. SUGGESTED PARAMETRIZATION OF EFFECTS OF VISCOSITY 


The effects of viscosity in producing diffusion of vorticity 
can be treated by including viscous fattening of the core in 
the calculations, (Leonard, 1980), (Shirayama and Kuwahara, 
1984) or by applying a random walk to markers on the filament 
at each time step, (Chorin, 1980), (Anderson and Greengard, 
1985). However, the above methods cannot deal with the 
observed merging and relinking of vortex rings, see (Fohl and 
Turner, 1975), (Oshima and Asaka, 1977), or trailing vortices, 
(Sarpkaya, 1983) if whole filaments are tracked. 


Assuming that dipole pairing is a real feature of the flow 
this suggests a plausible method of Pparametrizing the effects 
of viscosity in vortex filament calculations by removing any 
dipole paired sections of filaments and relinking the remainder. 
This is justified because in these regions the diffusive 
effects of viscosity will be most effective since the vorticity 
density and hence velocity gradients are very high. The two 
cores of opposite signed vorticity can be expected to diffuse 
into each other and result in zero net vorticity. They are 
also associated with highly stretched vorticity and hence the 
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smallest scales of motion. It is also computationally 
advantageous as fewer segments are required to calculate the 
velocity field. 


The need to change the topology of closely interacting 
vortex filaments has been widely recognised and Leonard (1975) 
implemented a reconnection method as did Schwarz (1982) in 
the collisions of vortex filaments in superfluid helium. 


5. SIMULATION OF COLLIDING VORTEX RINGS 
5.1 Experimental Observations 


Fohl and Turner (1975) observed the collisions of two 
identical vortex rings in water at high Reynolds numbers and 
found that the details of the interactions depended on the 
angle between their initial paths, the collision angle ġ. If 


¢ was less than 32° the two rings would normally merge and 
remain as one. For larger angles the two rings merged and then 
eventually separated to move at right angles to the original 
Paths. 


5.2 Numerical Method 


The same vortex method was implemented but using a smoother 
functiony(a) and with the rings in free space so that 


(a) = tanh(|4]3/o3) for all a (5.2.1) 


The rings had the same proportion of radii and separation as in 
the experiments, with 60 segments per ring. A value of I was 
used to produce approximately the observed velocity scaled to 


the radii with o = 1.5 tj kept constant. A simple segment 


removal algorithm was implemented removing segments if the 
Separation of their mid points was less than a and the angles 
between the vorticity vectors greater than@ o This involves 


detailed book-keeping. 
5.2 Results 


Choosing a = 0.02 and 8, = 175°, the parametrization worked 
in simple situations and, for example, correctly predicted the 


behaviour when $= 40° and 20° as shown in Figs. 3 and 4. 
However, problems can occur due to the generation of short 
wavelength structure along the filaments and a more 
Sophisticated parametrization and vortex method should be used. 
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T=0-0 T= 02 


Fig. 3 Interaction of 2 vortex rings, ġ = 40 


Filament 1, — — — Filament 2 


ae 
g 


T=0-2 T=1-0 


Fig. 4 As Fig. 3 but with ¢ = 20 
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T=0:3 


Fig. 5 Comparison of inviscid vortex method (left) with 
segment removal parametrization of viscosity (right) 


for $ = 40° 
Filament 1, — — — Filament 2 


Fig. 5 shows the comparison of the configurations with and 


without segment removal for $ = 40° at t = 0.3. As can be 
seen the large scale structure is unaffected by the presence 
of the dipole pairs in the plane symmetry. 


6. CONCLUSIONS 


The observation of dipole pairing in inviscid vortex 
filament calculations is consistent with observations of 
filament relinking, although the close interactions are 
mistreated by the vortex method. This mechanism may be 
important in the stretching of vorticity whilst conserving the 
energy of the flow and suggests a method for parametrizing the 
effects of viscosity in the flow. 


Any method of including viscosity in vortex methods and 
also finite difference models should be able to reproduce the 
observed merging and relinking of vortex filaments. 
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AN EFFICIENT BOUNDARY-INTEGRAL METHOD FOR 
STEEP UNSTEADY WATER WAVES 


J.W. Dold and D.H. Peregrine 
(School of Mathematics, University of Bristol) 


1. INTRODUCTION 


A new method for computing the unsteady motion of a water 
surface, including the overturning of water waves as they break, 
has been developed. It is based on a Cauchy theorem boundary 
integral for the evaluation of multiple time-derivatives of the 
surface motion. The integrals are cast into the form of full 
matrices which can be solved iteratively for the instantaneous 
motion of discrete surface particles. The size of each time- 
step is determined to maintain a specified order of accuracy by 
using a truncated Taylor-series to perform explicit time- 
stepping. The numerical implementation of the method is 
efficient and accurate. For sufficiently large time-steps a 
growing "sawtooth" instability appears in the region of greatest 
point density, but the method is found to be stable for small 
enough time steps. A deliberate reduction in time step size to 
restore stability without smoothing normally only becomes 
necessary with the formation of the jet of a breaking wave where 
surface particles tend to become strongly concentrated. 


Boundary-integral methods can be used to reduce the 
calculation of the inviscid incompressible irrotational motion 
of a body of fluid to the evaluation of the motion of its 
surface alone. In a numerical scheme it is possible therefore 
to solve for the fluid motion using only a point discretisation 
of the surface, thereby substantially reducing the number of 
unknown variables in the problem. This approach has been 
implemented by Longuet-Higgins and Cokelet (1976), Vinje and 
Brevig (1981), Baker, Meiron and Orszag (1981) and Roberts 
(1983), using a number of different integral formulations to 
model the motion of two-dimensional gravity waves on water. 


In order to obtain accurate solutions for complicated 
surface motions, ranging from breaking waves to instabilities in 
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a train of travelling waves it is, nevertheless, necessary to 
use fairly large numbers of surface points. (For example, a 
steepening wave in shallow water, part of which is shown in 
Fig. 1 was computed using up to 200 points). This involves 
running times per time step which vary as the cube of the 
number of points for algorithms which solve the integral 
equation using matrix inversion or factorisation methods, or as 
the square, using iterative techniques. It is therefore 
important to ensure computational efficiency particularly for 
more complicated wave phenomena. 


STENTS SN NNT NNN NNT a S S S S 


Fig. 1 Nonlinear steepening of an initially gently sloping 
surface between uniform levels, y = 1 and y = 2.25. 
The dotted line marks the path of a surface particle. 
Only about 20% of the computed region is shown. 


2. MODEL AND ALGORITHM: 


With the fluid velocity determined by the gradient of a 
velocity potential $, Laplace's equation is satisfied in the 
body of the fluid, 


vp = 0. (2.2) 


With the values of ọ known on the surface, r = R(E); equation 


(2.1) can be solved to give the gradient of $ on the surface. 
Hence the kinematic boundary condition, 


DR 


be Vo (2.2) 


and the dynamic boundary condition, given by Bernoulli's 
equation, 
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2 


u 
Dea el 
DEA, G + gy) (2.3) 
(in which the pressure P may be taken to be constant on the 
surface) provide the basic information for time-stepping the 
surface motion. 


Equation (2.1) is not only satisfied by ¢, but also by 
derivatives of $, e.g. a beer etc. Since Bernoulli's equation 


in the form, 2 
SORE 
ETA A or) (2.4) 
gives o on the surface once Vọ is known, solving Laplace's 
equation for Vo. in the same way can be used to find the second 


time derivatives of R and 4. In principle the procedure can be 
carried further by using appropriate derivatives of equations 
(2.1) to (2.4) to calculate any order of time derivative of R 
and ġ. Using truncated Taylor series to perform the time- 
stepping this additional information improves efficiency by 
permitting relatively large time-steps to be made while 
maintaining a desired degree of accuracy. 


If the fluid is considered to be two dimensional with an 
impermeable flat bottom at y = -h, then the fluid may be 
considered to extend downwards to a reflection of the surface in 
the bottom. In complex notation, with xtiy = R(Ẹ) on the 
surface (R being complex-valued), the condition 


q(R*-2ih) = q*(R) a q=o, - i¢ (2.5) 


must be satisfied, where * denotes the complex conjugate. With 
this definition the complex velocity q is an analytic function 
of xtiy in the region between the fluid surface and its 
relection. 


3. BOUNDARY INTEGRALS 


The arclength gradient (8 of > is easily determined since 


the values of $(&) are known and their rate of change along the 
known surface, x = R(&), can be found. There are three forms 
of boundary integral method available for determining the 
normal gradient oe of ¢. 
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l. The use of Green's indentity, 


= $ [$G - Ghlas (3.1) 


together with the Green's function, 1n|x-x' lie provides one 
approach. This method, however, is not easily solved by 
iterative means so that matrix inversion or L-U factorisation 
has been used by Longuet-Higgins and Cokelet (1976) and by 
New, McIver and Peregrine (1985). The resulting penalty of 
running times which increase as the cube of the number of 
points is severe for large numbers of points. 


2. In a slightly different formulation of the problem, the 
surface can be considered to form a vortex sheet with potential 
flow on either side of the sheet as in (Baker et al., 1982) and 
(Roberts, 1983). Although the integral equation formed by this 
method can be solved iteratively, solving for higher time- 
derivatives is not as straightforward. 


3. Since the complex velocity q(x+iy) is analytic it is 
possible to make use of Cauchy's integral theorem, as in (Vinje 
and Brevig, 1981). This can be cast into the following 
integral equation involving principal values, 


Zz Z 
= peel Teta S "ds! 
Li f, Im TEA $588 + { Re e] ¢as (3.2) 


where the arclength s is measured in a clockwise sense, the 
closed contour C surrounds the fluid, and (9 is the inward 


normal gradient of ġ. This equation can be solved iteratively 
and can be used to calculate the gradients of higher 
derivatives of $. 


Because of these advantages the latter method was chosen as 
the basis of a numerical scheme for following the evolution of 
the water surface. 


4. APPLICATION OF CAUCHY'S THEOREM: 


Assuming for simplicity that the surface is periodic, and 
that time and space dimensions are suitably scaled to make this 
period exactly 2m in x then the infinite fluid surface can be 
transformed into a finite closed contour by a conformal 
transformation, giving: 

PA ei R( &) 5 


R(E) (4.1) 
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Applying the formula (3.2) to this contour and the 
transformation of the reflected surface leads to the following 
integral equation: 


ve % 
"h -$m C E AA 


Q R 
+ | SEN - un: — |pracs (4.2) 
Q-e.~/Q'* 


where oe is simply the derivative of $ with respect to € along 
the surface, while by is the outward normal gradient of Ci) 
scaled by [2,|. Having solved for gs the complex potential 
gradient of $ in the physical plane is given by 


aig SE 
$y + ae =i a (g + ig,)- (4.3) 


It can be noted that the term arising from the bottom 
condition in the integral kernels of equation (4.2) does not 
vanish as h > œ, The limit, 2/2, however makes no overall 


contribution to the integrals. It can therefore be subtracted 
out so that the term arising from the bottom condition can be 
altered to the following: 

2h 


e ,./Q'* 


- (4.4) 
2(Q-e P/a" +) 


This makes a considerable difference to the diagonal dominance 
of the matrices in equation (5.3) below. The iterative 
behaviour improves from being divergent, when this alteration 
is not made, to being convergent. 


5. NUMERICAL SOLUTION 


In order to solve equation (4.2) numerically it is now 
convenient to identify — asa point-label parameter taking 
integer values for points R(&) on the surface. By considering 
a Taylor expansion for 2' it can be seen that 


Q 
E a EE am 
ms pen a a OEE (5.1) 


m 
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Thus only the real part of Re / (R=!) is genuinely singular. 


Moreover, since 


=- 9 (5.2) 

Bre 
the effect of this singularity is easily taken into account. 
Because of the periodic nature of problem, the best available 
quadrature formula representing the integrals of equation (4.2) 
can be shown to give 


mo (E) = = A(E,E')O,(8") 7 bgg (8) + E B(E,E')$ (E) 


(553) 
where 
2 
tabi -E f 
nen tS AE 
ona n yar 
A ABE A ae E a Ala (5.4) 
Q(Q-e — /Q'*) 
2 
EE 
rr aA aie ae ks 
2; 


A program was written using this formula to solve for the 
potential gradients. Given a smooth periodic discretisation of 
the surface R(€) the transformation (4.1) is made and the 
matrices of equation (5.3) are calculated. With $(€) defined 
on the surface the same matrices are used to directly calculate 
up to the third Lagrangian time derivatives of R and ¢ by the 
method outlined between equations (2.1) and (2.5). After two 
time-steps quadratic backward differencing is used to estimate 
up to the fifth time derivative and Taylor series are used to 
perform explicit time-stepping . The size of time-step for this 
is calculated as the geometric mean of the time-steps which 
would make the maximum contributions to the Taylor series of the 
third and fourth derivatives less than a given small precision 
parameter £. 

6. STABILITY AND ACCURACY 


A particular problem which has been reported with other 
methods is the generation of a "sawtooth" numerical instability 
which has to be kept under control using smoothing techniques. 
Even for very steep waves this instability appears to be absent 
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from this method when time step sizes are sufficiently small. 
For step sizes greater than some critical value (which depends 
on point density) sawtooth instability does appear. It is 
first noticeable as a localised "roughness" in the highest 
derivative calculated while the corresponding roughness in the 
profile is still extremely small. A correspondingly small 
degree of smoothing of this profile can be used to keep the 
instability in check although the critical time step is not 
normally found to be restrictive and smoothing is not normally 
used. 


As the jet of a breaker develops the points near the tip 
tend to move together and become more densely distributed. 
Under such circumstances the sawtooth instability is found to 
become important. It can be completely controlled without 
smoothing by reducing the time step size as necessary. This, 
however, leads to an increase in running time so that it is 
generally more efficient to use smoothing techniques even 
though some information is lost in doing so. 


It is found that the ability to accurately calculate higher 
and higher derivatives is eventually limited by the adequacy 
of the discretisation of the profile. In regions of large 
surface extension, such as beneath the jet of a breaking wave 
where surface particles are drawn apart, a particle-following 
time-stepping technique leads to poor resolution of the motion 
and the highest derivatives are the first to show signs of 
inaccuracy. 


7. SPEED 


The running time is empirically found to vary in the 
proportion, 


C.P.U. time = otn? E73) (7.1) 


with an accuracy of about cero? after T wave periods, provided 
the number of points N is adequate to describe the surface. 
For more complicated surfaces the running time thus increases 
as the square of the number of points required to describe the 
surface. For wave propagation covering many periods 
proportionally smaller values of £ are required to maintain a 
given final accuracy. Since the running times are found to 


vary more-or-less as ag) the penalty for this is not great. 
A reduction in € by a factor of 10 roughly doubles the running 
time. 
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An example is given by the wave in Fig. 1, showing part of 
a computed wave profile. This was calculated on a computer 
operating at about 80,000 floating point operations per second. ` 
Using 120 points the program calculated up to the time, t = 6.5, 
in 7 minutes of C.P.U. time. It took only 21 time steps to 
reach this stage with an overall accuracy of about 0.2% showing 
the effect of calculating higher derivatives. For comparison, 
Baker, Meiron and Orszag (1981) quote a time of about 30 
seconds on a Cray-l (equivalent to about 10 hours on our 
computer) for a similar calculation using 128 points and 400 
time-steps. The remainder of the wave motion shown in Fig. l. 
was calculated with 200 points using both smoothing and 
progressive reduction in step-size to control the instability. 
A further 47 time steps were made, taking 50 C.P.U. minutes. 


8. CONCLUSIONS 


Cauchy's integral theorem is used to set up a boundary 
integral equation relating the normal potential gradient with 
the tangential potential gradient at a periodic wave surface. 
Discretisation of the integral kernels leads to quadrature 
formulae involving full matrices in a form suitable for 
iterative solution. The same matrices are used to calculate 
the velocity of surface particles as well as acceleration and 
the rate of change of acceleration. The technique can be 
successively continued to calculate any higher time derivatives 
of the surface motion. Taylor series are used to explicitly 
step the wave surface forward in time using fairly large time- 
step sizes for a given accuracy. 


The combination of relatively large time steps and iterative 
solution of the boundary-integral equation leads to a numerical 
scheme which is fast and accurate. The scheme is efficient 
enough to be able to compute lengthy and complicated surface 
motions with only moderate computer resources. 
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